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THE MECHANISM OF RHODOPSIN SYNTHESIS 
By RutH HUBBARD AND GEORGE WALD* 
BIOLOGICAL LABORATORIES OF HARVARD UNIVERSITY, CAMBRIDGE, Mass. 
Communicated December 22, 1950 


We have recently demonstrated the synthesis of rhodopsin from vitamin 
A, and opsin (rhodopsin-protein) in retinal homogenates and aqueous 
retinal extracts.'_ This process was shown to be aided by supplementation 
with homogenates of the pigment layers of the eye—the pigment epithelium 
and choroid—and with cozymase (DPN). A mechanism was proposed 
for rhodopsin synthesis, based upon the following considerations. 

(1) Vitamin A, is in equilibrium with retinene; through the retinene 
reductase system, in which cozymase functions as coenzyme.? The equilib- 
rium lies far over toward the side of reduction (vitamin A;), but can be dis- 
placed in the oxidative direction by the use of a retinene-trapping reagent 
such as hydroxylamine, which condenses spontaneously with retinene, to 
form retinene; oxime. In this process the endergonic oxidation of vitamin 
A, to retinene, is coupled with the exergonic trapping reaction.' 

(2) Retinene; condenses spontaneously also with opsin to form rhodop- 
sin.’ This exergonic process can serve as a retinene-trapping reaction in 
the retina. Rhodopsin may therefore be synthesized by the oxidation of 
vitamin A, to retinene, by the retinene reductase system, coupled with the 
condensation of retinene,; with opsin to form rhodopsin. 

These reactions are summarized in the following diagram: 
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In this system the oxidation of vitamin A, to retinene, is the limiting 
process. It is presumably here that DPN acts in promoting rhodopsin 
synthesis. 

In the experiments to be described we have examined systematically the 
effects on rhodopsin synthesis of factors which promote the oxidation of 
vitamin A;. We have analyzed in part the contribution of the pigment 
epithelium. Finally, we have assembled a minimal system of well-defined 
components, which synthesizes rhodopsin from vitamin A; in aqueous 
solution. 

Procedures.—The retinas used in these experiments were dissected from 
the eyes of dark adapted frogs (Rana pipiens) under dim red light. The 
pigment layers of the eye—the combined pigment epithelium and choroid 
layer——were collected separately. The retinas were bleached by exposure 
to an intense tungsten source, filtered through Corning glasses 3387 and 
3966 to remove heat, and violet and near ultraviolet radiation which might 
otherwise destroy vitamin A, and retinene,. The retinas were left for 1- 
1'/> hours in the light at room temperature, to allow all reactions to be 
completed. They were then colorless, all the retinene, liberated in the 
bleaching of rhodopsin having been reduced to vitamin Aj. 

Preparations for the synthesis of rhodopsin were incubated in the dark 
at room temperature, usually overnight. Rhodopsin was extracted with 2 
per cent crystalline digitonin in water, the digitonin acting as solubilizer. 
Digitonin was used also to bring vitamin A, or retinene, into aqueous solution. 

Digitonin extracts were examined for the presence of rhodopsin by meas- 
uring the absorption spectrum in darkness, bleaching in bright tungsten 
light filtered through yellow Corning glass 3384, and re-measuring the ab- 
sorption spectrum. The difference in absorption before and after bleaching 

the so-called difference spectrum—identifies rhodopsin and measures its 
quantity. All the figures in the present paper are such difference spectra. 

Opsin was prepared by methods similar to those used to make the purest 
preparations of rhodopsin.’ Cattle retinas were bleached to colorlessness, 
disintegrated mechanically, and the outer segments of the rods were iso- 
lated by centrifuging in 35 per cent sucrose solution. The rods remain in 
suspension, while the remaining retinal tissues are sedimented. The sus- 
pension in sugar solution is diluted, and the rods collected by centrifuging. 
They are tanned in 4 per cent alum, and washed repeatedly with water and 
neutral phosphate buffer. Sometimes they are lyophilized, and extracted 
with petroleum ether. The residue of these treatments is free from water- 
soluble and fat-soluble material, and most of its remaining proteins have 
been rendered insoluble by the alum treatment. From this residue opsin is 
extracted into clear, colorless solution with 2 per cent aqueous digitonin. 

Oxidative Factors: ‘‘Succinoxidéise.’’-We have proposed that cozymase 
stimulates rhodopsin production by acting in the retinene reductase system, 


f 
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in which it oxidizes vitamin A; to retinene,, and is itself reduced in the 
process. If this view is correct, an enzyme system which re-oxidizes cozy- 
mase should aid in rhodopsin synthesis. - 

The so-called ‘‘succinoxidase”’ of heart muscle is of this nature. It is a 
particulate complex that includes all the components needed to transfer 
hydrogen from cozymase to oxygen: cytochrome oxidase, the cytochromes, 
and riboflavine enzymes. In our experiments we have used the beef heart 


T 
Frog- retinal homogenate + DPN 


Frog-retinal homegerate * DPN 


nosdase 
ant pament /ayers 


Extinction 


Eatsnction 


9 


500 600 500 

Wavelength-™u Wa velength-my 
FIGURE 1 

Stimulation of rhodopsin synthesis with ‘‘succinoxidase’”’ and pigment layer homog- 
enate. Frog retinal homogenate supplemented with DPN, and incubated alone, with 
‘“‘succinoxidase,” with pigment layer homogenate, and with both “‘succinoxidase’”’ and 
pigment layer homogenate. Difference spectra of rhodopsin extracted after incubation 
are shown. Each preparation contained 4.5 retinas, 1.5 mg. DPN, and 0.002 M NaF 
in a final volume of 1.0 ml. Each portion of pigment layer homogenate contained 
pigment layers of 9 eyes. ‘‘Succinoxidase” was supplemented with cytochrome c, 
1.5 X 

FIGURE 2 

Stimulation of rhodopsin synthesis by vitamin A). Frog retinal homogenate supple- 
mented with DPN, and incubated alone and with added vitamin A;. Difference spectra 
of rhodopsin extracted after incubation are shown. 


preparation of Ball and Cooper,' supplemented with cytochrome ¢ in a 
final concentration of 10~° M. 

Figure 1 shows the results of one such experiment. Retinas of 9 frogs 
were isolated and bleached to colorlessness (vitamin A; and opsin). The 
retinas were homo zenized, supplemented with cozymase and divided into 
4 equal portions. These were incubated in the dark at room temperature 
in the following way: one diluted only with buffer, the second with ‘‘suc- 
cinoxidase,”’ the third with a homogenate of the pigment layers from 9 eyes 
and the fourth with both ‘‘succinoxidase’’ and the same amount of pigment 
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layer homogenate. The final volume of each mixture was 1.0 ml. After 
incubation, all preparations were centrifuged, and the solid residues were 
extracted with 0.65 ml. of 2 per cent aqueous digitonin. 

Figure | contains the difference spectra of these extracts. It shows 
that the addition of ‘‘succinoxidase’’ to the retinal homogenate increased 
the yield of rhodopsin 33 per cent, or 50 per cent in the presence of pigment 
layer homogenate. The latter roughly doubled the yield of rhodopsin, 
whether or not “‘succinoxidase’’ had also been added. These observations 
show that “‘succinoxidase’’ promotes the synthesis of rhodopsin from vita- 
min Aj, and that this effect is relatively independent of the action of the pig- 
ment layers. 

In the mechanism of rhodopsin synthesis that we have proposed, such 
respiratory factors as ‘‘succinoxidase’’ participate only indirectly, by re- 
oxidizing DPN-H,. In the presence of excess DPN, such factors may not 
be required at all. Probably for this reason we have found in some experi- 
ments that removal of oxygen or poisoning with 10~* M cyanide interferes 
relatively little with rhodopsin synthesis in retinal homogenates supple- 
mented with DPN, or with the promotion of this synthesis by added pig- 
ment layer homogenate. 

Addition of Vitamin A,.—Figure 2 shows the results of a typical experi- 
ment. Right and left retinas of 5 dark adapted grass frogs were dissected 
into separate vessels and were bleached to colorlessness. One set of retinas 
was homogenized with 0.45 ml. neutral phosphate buffer and one drop of 
cottonseed oil containing 0.5 per cent of the antioxidant, a-tocopherol; 
the other set with an identical mixture in which the oil contained also 0.07 
mg. vitamin A;. To each homogenate, 1.8 mg. DPN was added, and both 
were incubated for 12 hours in the dark. They were centrifuged, and 
rhodopsin was extracted from the solid residues with 0.65 ml. of 2 per cent 
aqueous digitonin. The difference spectra of these extracts are shown in 
figure 2. The added vitamin A, had increased the synthesis of rhodopsin 
about 65 per cent. 

We have found that a more effective way to add vitamin A; to such a 
system is to grind the oil containing the vitamin with a dry powder of lyo- 
philized retinas. The powder is afterward suspended in buffer solution, 
and is incubated in the dark. Such an experiment is described below (Fig. 4). 

Role of the Pigment Epithelium.—We have shown that the addition of 
pigment layer homogenate greatly increases the yield of rhodopsin in retinal 
homogenates, whether otherwise unsupplemented, or supplemented with 
DPN or “‘succinoxidase.”’ 

It might be supposed that the pigment layers themselves synthesize 
rhodopsin. We have, however, supplemented pigment layer homogenate 
strongly with vitamin A; and DPN, and could observe only a trace of rho- 
dopsin formation, no more than might be explained by contamination 
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with a few rods. Pigmeut layers have also been tested for the presence 
of retinene reductase by incubating with DPN and hydroxylamine, the 
latter to act as a retinene-trapping reagent; no evidence was obtained of 
oxidation of vitamin A, to retinene;. The pigment layers apparently do 
not form rhodopsin; they seem to lack opsin, and probably also retinene 
reductase. 
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FIGURE 3 
The fat-soluble factor of the pigment layers. Frog retinal homogenate supplemented 
with DPN was incubated alone, with a brei of lyophilized pigment layers, and with a 
similar brei of lyophilized pigment layers which had previously been extracted with 
petroleum ether. Difference spectra of rhodopsin extracted after incubation are shown. 
The fat-extracted pigment layers had lost most of their capacity to stimulate rhodopsin 
synthesis. 
FIGURE 4 


Synthesis of rhodopsin from extra-retinal vitamin A;. Lyophilized retinas were 
extracted with petroleum ether to remove vitamin A,, and were supplemented with 
DPN in neutral phosphate buffer. Equal portions were incubated alone, with lyo- 
philized pigment layers, and with vitamin A, in oil. The fat-extracted retinas make 
almost no rhodopsin. The rhodopsin formed in the other preparations is derived 
therefore from the vitamin A; added in oil or from the pigment layers. About 5 times 
as much vitamin A, was added in oil as was contained in the pigment layers. 


The effectiveness of the pigment layers in our experiments is due princi- 
pally to a fat-soluble factor. If pigment layers are extracted with pe- 
troleum ether, they lose most of their capacity to stimulate rhodopsin 
synthesis. This is shown in figure 3. 
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A retinal homogenate which had been supplemented with DPN was in- 
cubated in the dark (a) alone; (b) with 6.3 mg. of a powder of lyophilized 
pigment layers; and (c) with 6.3 mg. of the same pigment layer powder 
which had been extracted twice with petroleum ether. After 12 hours incu- 
bation, rhodopsin was extracted from all three portions with digitonin 
solution. Figure 3 shows the difference spectra of these extracts. The ad- 
dition of lyophilized pigment layers to the retinal tissue had tripled the 
yield of rhodopsin (b); but this effect was very much reduced on extraction 
with petroleum ether (c). The same result is obtained when the extraction 
is performed at — 15°C. 

The fat-soluble factor of the pigment layers is vitamin A). The pigment 
layers contain a large quantity of vitamin Aj, in this species about 4 ug. 
per eye, concentrated almost wholly in the pigment epithelium.’ A simple 
experiment shows that in our homogenates, vitamin A, is transferred from 
the pigment epithelium to the retinal tissue for rhodopsin synthesis. 

Dark adapted retinas of 8 frogs were bleached to colorlessness, lyophil- 
ized, ground to a powder, and extracted thoroughly with petroleum ether 
to remove vitamin A;. The dry powder was divided into three equal por- 
tions, and treated as follows: portion (a) was left unsupplemented; (b) 
was stirred with 0.2 ml. cottonseed oil containing 0.27 mg. vitamin A; and 
1 mg. a-tocopherol; and (¢) was mixed with a lyophilized powder of 12 
pigment layers. Each portion was stirred with 0.33 ml. phosphate buffer, 
pH 6.8, containing 1.2 mg. DPN, and was incubated for 12 hours in the 
dark. It was then extracted with 0.65 ml. of 2 per cent digitonin solution, 
and the difference spectra of these extracts were measured. 

Figure 4 shows the results of this experiment. From the very small 
synthesis of rhodopsin in (a), it is clear that extraction of the retinal tissue 
with petroleum ether had removed almost all vitamin A;. Therefore vir- 
tually all the rhodopsin in (6) and (c) must have been made from extra- 
retinal vitamin Ay. In (0) the vitamin A; was supplied as a solution in oil, 
and had resulted in a 40 per cent yield of rhodopsin. In (c) the vitamin A, 
for rhodopsin synthesis was supplied by the lyophilized pigment layers, 
and had resulted in a 25 per cent yield. It should be noted that the amount 
of vitamin A, added in oil in (6) was about 5 times as great as the pigment 
layers contained in (c). 

This experiment shows unequivocally that in homogenates the vitamin 
A, of the pigment epithelium is available for rhodopsin synthesis by the 
retinal tissue. It shows also that supplementation of retinal tissue with 
vitamin A, can substitute for supplementation with whole pigment layers. 
It should be recalled that these experiments are all conducted in the pres- 
ence of excess DPN. Under these circumstances it seems that the primary 
action of the pigment layers is to supply the retinal tissue with vitamin Aj. 
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This, however, is not their only contribution to rhodopsin synthesis. 
The activity of frog pigment layers is greatly reduced by heating for 2-3 
minutes at 100°C., though they still contain large amounts of vitamin Aj. 
From homogenized pigment layers, deep yellow, opalescent solutions can 
be prepared in neutral M/15 phosphate buffer, which strongly stimulate 
the synthesis of rhodopsin in retinal homogenates, though they contain very 
little vitamin A,. The activity of such solutions is destroyed by boiling 
for 2-3 minutes. They contain heat-labile factors, probably proteins, 
which aid in rhodopsin synthesis. Their yellow color is due principally to 
riboflavine. Their composition and action are being examined further.® 

It must not be forgotten also that in the intact eye the layer of rods and 
cones depends primarily upon the choroidal circulation for its exchanges 
with the blood, and receives oxygen and metabolites from this source by 
diffusion through the intervening pigment epithelium. In cold-blooded 
animals, the retina has no blood vessels, and except for the sparse hyaloid 
vessels, depends entirely upon the choroidal circulation. In mammals, the 
retinal blood vessels supply primarily the inner retinal layers, and the sen- 
sory epithelium still depends principally upon the choroid.’ These are 
added reasons why the normal functioning of the rods and the synthesis of 
rhodopsin in vivo demand close contact between the retina and the pigment 
epithelium. 

Synthesis of Rhodopsin in a System of Known Components.—If our as- 
sumptions concerning the mechanism of rhodopsin synthesis are valid, it 
should be possible to perform this synthesis in a system containing only 
four components: vitamin A), retinene reductase, cozymase and opsin. 

Of these substances, only retinene reductase is not yet available in 
highly purified condition. Bliss, however, has reported that the equilibrium 
between vitamin A; and retinene; is catalyzed by crude preparations of 
liver alcohol dehydrogenase, with DPN as coenzyme.* We have confirmed 
this observation, using the crystalline alcohol dehydrogenase of horse liver, 
prepared by Bonnichsen.? Dr. Warren Yudkin in our laboratory has 
found also that a homogenate of frog retinas, supplemented with cozymase, 
oxidizes ethyl alcohol. It is possible therefore that retinene reductase is 
identical with liver alcohol dehydrogenase. The relation between these 
enzymes is being examined further.” 

We have assembled in aqueous digitonin solution the four components 
which should constitute a minimal system for the synthesis of rhodopsin: 
vitamin Aj, concentrated from fish liver oils; crystalline alcohol dehydro- 
genase from horse liver; cozymase (Sigma, 90 per cent pure); and purified 
opsin, prepared as described above. As control, an identical mixture was 
prepared which lacked only the alcohol dehydrogenase. 

Figure 5 shows the results of such an experiment. In this instance the 
control mixture generated a small amount of rhodopsin, the source of which 
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is not yet clear. 
sized with high efficiency. 


Proc. N. A.S 


In/the complete mixture, however, rhodopsin is synthe- 
Indeed, the opsin used in this experiment, when 


mixed with excess retinene,, yielded almost exactly as much rhodopsin as 
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FIGURE 5 

Synthesis of rhodopsin in a solution of known 
components. The upper curve shows the 
difference spectrum of rhodopsin synthesized 
by incubating together: vitamin Ay, 0.35 mg. 
in 0.08 ml. cottonseed oil containing 0.5 per 
cent a-tocopherol; crystalline horse liver alco- 
hol dehydrogenase, 0.6 mg.; cozymase, 0.2 mg.; 
and frog opsin in 2 per cent digitonin. Final 
volume, 0.8 ml, in neuiral phosphate buffer. 
The lower curve shows the rhedopsin formed 
in an identical mixture lacking only the alcohol 
dehydrogenase. The difference spectra were 
measured in the presence of hydroxylamine, 
0.17 M, to block regeneration. The opsin 
used in this experiment, incubated separately 
with excess retinene,, yielded rhodopsin with a 
maximal density of 0.18, almost exactly the 
same amount as was synthesized here from 
vitamin A). 


was formed from vitamin A). 
It seems therefore that the 
yield of rhodopsin in this ex- 
periment was limited only by 
the amount of available opsin. 

Discussion.—The net result 
of our experiments is to con- 
firm the view that rhodopsin 
is synthesized by the oxidation 
of vitamin A; to retinene,, 
coupled with the condensation 
of retinene; with opsin. All 
factors known to promote the 
oxidation of vitamin A, aid in 
the synthesis of rhodopsin: 
vitamin A), cozymase and oxi- 
dative systems, such as ‘‘suc- 
cinoxidase,’’ which keep cozy- 
mase oxidized. The pigment 
epithelium has been shown to 
stimulate rhodopsin synthesis 
in our experiments primarily 
by supplying one of these fac- 
tors, vitamin Aj, to the retinal 
tissue. The water-soluble, 
heat-labile factor of the pig- 
ment epithelium is still to be 
identified, and its function 
determined. The proposed 
mechanism, however, is estab- 
lished unequivocally with the 
demonstration that rhodopsin 


is synthesized in solution in the system of known components just de- 


scribed. 


Is this the only pathway of rhodopsin synthesis? The only reply to this 
question possible at present is that the mechanism we have described may 
well be adequate, and that there is as yet no evidence of an alternative 


mechanism. 


The known reactions that compose the rhodopsin system can therefore 


be formulated as follows: 
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Rhodopsin 


Lumi-rhodopsin 


> -20°C 
Meta-rhodopsin 
retinene reductase | 


DPN-y 
VitaminA,+Opsin Ret inene, + Opsin 


{} DPN 


VitamunA, from os/dative 
pigment epithehum 
and circulation 


The right-hand portion of the diagram indicates that all the intermediate 
stages in bleaching may not be retraced when retinene, and opsin recombine 
to form rhodopsin. 

Of the four components from which the rhodopsin system has been as- 
sembled in vitro—vitamin Aj, retinene reductase or alcohol dehydrogenase, 
cozymase and opsin—only the last appears to be confined to the retina, 
and indeed to the outer segments of the rods. All the other components 
have a wider distribution in tissues, and were obtained in our experiments 
from extra-retinal sources. The specific factor that probably restricts the 
synthesis of rhodopsin to rods, and limits its quantity there, is opsin. 

It is important to note that the rhodopsin system, though isolable in 
vitro as we have described, maintains basic connections in vivo with the cir- 
culation and the general metabolism. Some years ago it was shown that 
during light adaptation in vivo, the retina loses vitamin A, by diffusion into 
the pigment epithelium and the blood; and that it recaptures vitamin A, 
during dark adaptation by binding it in non-diffusible form in rhodopsin." 
These interchanges, confirmed and reemphasized in the present experi- 
ments, connect the rhodopsin system with the metabolism of vitamin A; 
throughout the organism, and with a more remote circumstance, its nutri- 
tional supply. 

Through its dependence upon cozymase, the rhodopsin system is con- 
nected also with the general processes of retinal respiration and fermenta- 
tion; and these in turn depend upon the blood circulation for supplies of 
oxygen and metabolites. The retinal metabolism determines not only the 
oxidation-reduction state of cozymase, but probably must maintain its 
concentration by continuous synthesis, for cozymase appears to be under 
constant attack by cellular nucleotidases.'? The rhodopsin system, there- 
fore, though it can be isolated in solution with four essential components, 
makes multiple connections 1m vivo with the circulation and the general 
metabolism. 

Summary.—It was proposed earlier that rhodopsin is synthesized by the 
oxidation of vitamin A, to retinene,; by cozymase (DPN) and retinene 
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reductase, coupled with the condensation of retinene. with opsin. (rhodop- 
sin-protein) to form rhodopsin. The present experiments confirm this 
view. There is as yet no evidence for the formation of rhodopsin by an- 
other mechanism. 

Factors which promote the oxidation of vitamin A; by retinal homoge- 
nates, all are shown to aid the synthesis of rhodopsin. These include cozy- 
mase, vitamin A; and oxidative systems such as heart particle ‘‘succinoxi- 
dase”’ which keep cozymase oxidized. 

The pigment epithelium of the eye is shown to promote rhodopsin syn- 
thesis by supplying vitamin A, to the retinal tissue. The pigment layers 
of the eye contain also a heat-labile, water-soluble factor, still unidentified, 
that stimulates the synthesis of rhodopsin. Through cozymase, the rho- 
dopsin system is connected with the general retinal metabolism, and this 
in turn depends for supplies of oxygen and metabolites upon exchanges 
with the blood circulation of the choroid, by diffusion through the inter- 
vening pigment epithelium. 

A system has been assembled from four well-defined components, which 
synthesizes rhodopsin from vitamin A; in solution. It contains vitamin 
Aj, cozymase, opsin and crystalline liver alcohol dehydrogenase which sub- 
stitutes for retinene reductase. 


* This research was supported in part by a grant from the Medical Sciences Division 
of the Office of Naval Research. A preliminary account of most of these experiments 
was presented at the 18th International Physiological Congress in Copenhagen, August 
15-18, 1950 (Abstracts of Communications, p. 264). 
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POLYPLOIDY IN YEAST AND ITS BEARING ON THE OCCUR- 
RENCE OF IRREGULAR GENETIC RATIOS* 


By HERSCHEL ROMAN, DONALD C. HAWTHORNE AND HOWARD C, DouGLas 
DEPARTMENTS OF BOTANY AND MICROBIOLOGY, UNIVERSITY OF WASHINGTON, SEATTLE 


Communicated by G. W. Beadle, December 16, 1950 


The significance of irregular ratios, i.e., phenotypic ratios of 4:0, 3:1, 
1:3, and 0:4 in asci obtained from crosses which are expected to give 2:2 
segregations and commonly do so, has been the subject of considerable 
speculation in yeast genetics. Lindegren' has cited such irregularities as 
evidence that a gene can be affected by its allele when the two are present 
in the same cell. On this interpretation, the 4:0 and 3:1 asci are those in 
which the dominant gene has converted the recessive allele to the dominant 
condition; the 1:3 and 0:4 cases are indicative of the degradation of the 
dominant by the recessive allele. 

The gene-conversion hypothesis does not lend itself at the present time 
to critical test, and therefore depends for its validation on the exclusion of 
other explanations based on known genetic mechanisms. To this end 
Lindegren and his colleagues have shown that neither multiple factors 
nor orthodox mutation can provide a general explanation for irregular ra- 
tios. Mundkur? has demonstrated the inadequacy of the Winge and 
Roberts’ hypothesis that accounts for these ratios in terms of an extra 
mitosis in the formation of ascospores. Finally, Mundkur‘ has excluded 
polyploidy as a factor in this problem on the grounds that it will not account 
for all of the irregularities that have been found. 

The evidence presented below that polyploidy does in fact occur in yeast 
has prompted us to reappraise its possible role in the interpretation of ir- 
regular ratios. 

Evidence of Diploid A scospores.—Among 64 four-spored linear asci whose 
spores were tested for mating type, galactose fermentation, and growth 
habit in liquid medium, one ascus was found which exhibited an irregular 
ratio for each of these characters. The asci were obtained from a cross 
between two clones of Saccharomyces that were presumably haploid and of 
composition agf and aGF, respectively.’ Clones were grown'from the four 
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spores of the exceptional ascus (No. 55) and were designated A, B, C and 
D, in the order in which the spores occurred in the linear ascus. Each of 
the clones gave a negative reaction in mating-type tests with the standard 
a and a@ strains. Clones A, C and D all fermented galactose within 24 
hours after being placed in Durham fermentation tubes. Clone B failed 
to ferment galactose during the 96 hours of the test. All of the clones ex- 
hibited the dominant flaky character in liquid medium. 

The failure of the four clones to mate with the standards suggested the 
possibility that each clone consisted of cells heterozygous for the mating- 
type alleles. Evidence for this view was obtained when it was found that 
each of the clones proved to be self-sporulating and produced asci on gyp- 
sum slants. Twelve asci, three from each of the four clones, were dis- 
sected and tests were made for mating type, galactose fermentation, and 


TABLE I 


Tue ResuLTS oF Tests oF Asct OBTAINED FROM THE SELF-SPORULATING CLONES 55 
A, B, C anp D 


CLONE Ascus 
TESTED No. COMPOSITION OF SPORES 


aGF agF aGf agf 
aGf aGF i agF 
agf aGF aGf agF 
agF agf agF agf 
agk agf agF agf 
agF agf agf agF 
ie aGF aGf aGF 
aGF aGf aGf aGF 
aGf aGF aGf aGF 
agf agF aGF aGf 
agF aGf aGF agf 
aGf agf aGF 


flaky. Ten of the twelve had four viable spores and exhibited a 2:2 segre- 
gation for both mating type and flaky. In the remaining two asci, only 
three of the four spores survived, but in these also a segregation for a and 
a and for F and f was observed. With respect to galactose fermentation, 
the asci from Clones A and D segregated for G and g, the asci from Clone 
B gave non-fermenters only, and the asci from Clone C gave fermenters 
only (table 1). 

The results of these tests show that the spores of Ascus 55 had the fol- 
lowing composition: Spore A, aa Gg Ff; Spore B, aa gg Ff; Spore C, 
aaGG Ff; and Spore D, aa Gg Ff. Since the three loci gave no evidence 
of linkage in these experiments, there is good reason to infer a disomic 
condition for at least three chromosomes of the complement and it is 
probable that each of the spores was actually diploid. Thus the irregular 
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55A 1 
2 
3 i 
55B 1 
2 
3 
55C 1 i 
2 i 
3 | 
55D 1 i 
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ratios which were exhibited in Ascus 55 can be resolved in terms of an 
orthodox polyploid expianation. 

Discussion. The information at hand does not permit us to distinguish 
between two alternatives for the origin of Ascus 55. First, the cell from 
which the ascus arose may have been tetraploid, of composition aaaa 
GGgg FFff. On this basis, the compositions of the four spores of Ascus 
55 would be accounted for on the assumption of a normal meiosis in which 
a crossover between the locus of G and the centromere had occurred. The 
manner of origin of the tetraploid cell itself is suggested by the observations 
of Lindegren! that asci are occasionally found within clones of single mating 
type. These asci have been interpreted to be the result of “‘illegitimate”’ 
hybridization between cells of like mating type. Such fusions would give a 
certain proportion of aa and aa@ diploids and the mixing of the two clones by 
Lindegren’s method should produce tetraploids of the type suggested above. 

Second, the diploid spores may have been the consequence of an extra 
division in spore formation. Suppose, for example, that a diploid ascoge- 
nous cell had produced four nuclei of composition aGF, aGf, agF and agf, 
respectively, and that each nucleus had undergone a mitosis; the result 
would be eight nuclei, as follows: (1) aGF (2) aGF, (3) aGf, (4) aGf, (5) 
agF, (6) agF, (7) agf and (8) agf. The subsequent nuclear fusion of 1 X 3, 
2X 7,4 X 5and 6 X 8 would result in diploid spores of the kind encoun- 
tered in Ascus 55. The precedent for an extra division is furnished by our 
own observations and those of Winge and Roberts’ that asci are occasion- 
ally found with more than four viable spores. It should be noted that the 
fusion of nuclei of like mating type following the extra mitosis would pro- 
vide diploid mating clones from which tetraploid zygotes could be obtained. 

The foregoing discussion suggests the possibility that clones derived 
from haploid spores may consist of both haploid and diploid cells. A cross 
between two such clones of opposite mating type should produce, in addi- 
tion to diploid and tetraploid zygotes, a number of triploid zygotes. The 
relative proportions of the three classes would depend on their relative 
viabilities and on the time and frequency of diploidization within the parent 
clones. As indicated above, tetraploid clones could also be the result of 
the fusion of two diploid spores derived from the extra mitosis and triploid 
clones would be obtainable from these in further crosses. Spores obtained 
from the tetraploid and triploid lines should in turn give rise to aneuploid 
clones of various kinds. 

These polypioid situations make possible an array of genetic combina- 
tions that can furnish material from which all of the types of irregular ratios 
can be obtained. Consider, for example, some of the zygote combinations 
that could be expected from a cross between a galactose-fermenter clone 
and a non-fermenter. If each of the two clones consists of a mixture of 
haploid and diploid cells, zygotes would be produced of composition Gg, 
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GGg, GGgg and Ggg. The types of asci which each of these zygotes could 
produce are given in table 2, Each of the combinations would give asci 
with a 2:2 segregation for galactose fermentation. In addition, asci de- 
rived from GGg and GGgg cells should exhibit 4:0 and 3:1 ratios, the latter 
depending on a crossover which places G and g on the same dyad. 


The 1:3 and 0:4 ratios are to be expected from the Ggg clone. The 
1:3 ratio would be obtained as a consequence of the movement of two G-g 
dyads to the same pole following crossing over. To account for the 0:4 
ratio it is necessary only to assume that in yeast, as in many other organ- 
isms that have been investigated, chromosomal loss in meiosis is the con- 


TABLE 2 


Typrs or Asct ExpECcTED FROM A Cross BETWEEN A GALACTOSE-FERMENTER CLONE AND 

A NON-FERMENTER ON THE ASSUMPTION THAT EACH CLONE Contains BotH HAPLOID 

AND DipLorp Ce_its. THE Errect oF CHROMOSOMAL Loss IS SHOWN ONLY FOR THE 
Ggg GENOTYPE 


PHENO- 
TYPIC 
RATIO 
Zyoortic WITHIN 
GBNOTYPB Typss or Asc1-——-—-—— ASCUS REMARKS 


Gg ; 2:2 
GGgg 4:0 


3: 
GGg 


Crossover 
Crossover 


Crossover 

Crossover 

Loss of chromosome 
carrying G 


comitant of the polyploid condition. The loss of the chromosome carry- 
ing G would result in an ascus in which all four spores would produce non- 
fermenter clones. 


A polyploidy hypothesis can thus provide a formal basis for the inter- 
pretation of all of the irregular ratios that have been encountered in crosses. 
Moreover, the hypothesis is subject to test since it predicts a number of 
types of identifiable asci, some of which are given in table 2. The tests 
must be conducted under conditions which will permit the detection of 
illegitimate hybridization if it occurs. Unfortunately, the most direct 
evidence, from chromosomal counts, cannot as yet be obtained for lack 
of suitable cytological techniques. 


j 
GG GG gg 9:9 
1 Crossover 
GG GC , 
| GG Gg G 
G gg 3 
Gee Gg Gg ‘ 
Gg G ee 
GG ge 
g g 0 | 
| 
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There are, however, two aspects of the already published. data that bear 
on this question. First, there is the evidence that an irregular ratio for one 
character is usually accompanied by an irregular ratio for another within 
the same ascus. Thus among 48 four-spored asci in which irregular ratios 
occurred,' * 35 exhibited an irregularity for more than one character. It is 
perhaps also significant that the 48 asci were produced from crosses in 
which one parent was either a clone (CIA), obtained from an ascospore of 
Saccharomyces carlsbergensis, or a descendant of this clone. An interpre- 
tation of this information in terms of the gene-conversion hypothesis re- 
quires the somewhat difficult assumption that the capacity for conversion 
can affect several loci simultaneously in one pedigree but not in another; 
the clone CIA is regarded as a “converter”? parent which transmits the 
capacity for conversion to its progeny. 

The same evidence can be explained readily on the basis of polyploidy. 
The simultaneous occurrence of irregular ratios for two or more characters 
in an ascus would be expected if the zygote were polyploid. For example, 
zygotes of composition GGg FFf should produce a large proportion of asci 
with 4:0 ratios for both galactose fermentation and flaky. Moreover, the 
selection for lines which give a high frequency of irregularity could be inter- 
preted as a selection for the polyploid condition. It would follow from this 
that the CIA clone and some of its descendants were probably diploid, to 
take the simplest view. 

Second, the most striking and direct support for the polyploidy hypothe- 
sis comes from the observations that crosses which yield irregular ratios 
for other characters also produce mating type disturbances. By analogy 
with the segregations for G and g in table 2, we should expect asci in which 
one, two, or four spores are of composition da. These spores, according 
to our experience with Ascus 55, should give rise to clones that will not 
mate with either the a or a standards. Five asci have been reported! * ® 
in which all of the spores gave “‘sterile’’ clones. It is noteworthy that each 
of the five asci, like Ascus 55, exhibited irregular ratios for other characters 
as well. Mundkur‘ also makes reference to asci in which two of the four 
spores gave sterile clones. We ourselves have found an ascus in which 
only one of the spores was of this type. Tests of the clone derived from 
this spore have shown that the spore was da. The occurrence of sterile 
clones and their correlation with the aa genotype is convincing evidence 
of the role of polyploidy in the production of irregular ratios. 

The foregoing discussion indicates that the polyploidy hypothesis of- 
fers a basis for an interpretation of irregular ratios on conventional grounds 
and moreover that it is supported by the evidence at hand. Further in- 
vestigation is required to determine the extent to which other explanations 
may be necessary. 
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* This investigation received support from the Biological and Medical Fund of the 
State of Washington. 

1 Lindegren, C. C., The Yeast Cell, Educational Publishers Inc., St. Louis, 1949. 

? Mundkur, B. D., Current Science, 19, 84-85 (1950). 

5 Winge, O., and Roberts, C., Nature, 165, 157-158 (1950). 

* Mundkur, B. D., Ann, Missouri Bot. Gard., 36, 259-280 (1949). 

5 All of the crosses reported in this paper were between clones derived from single 
ascospores. We are indebted to Dr. C. C. Lindegren for the parent clones. The symbols 
refer to genes affecting the following characters: aand a, mating type; Gand g, galac- 
tose fermentation and non-fermentation (within 96 hours), respectively; F and f, flaky 
and free dispersion, respectively, in liquid medium. The flaky character was particularly 
clear-cut in DIFCO Yeast Nitrogen Base medium with 1°% glucose added. 

6 Mundkur, B. D., and Lindegren, C. C., Am. J. Bot., 36: 722-727 (1949) 


ON THE FUNCTIONAL EQUATIONS OF THE DIRICHLET SERIES 
DERIVED FROM SIEGEL MODULAR FORMS 


By RIcHARD BELLMAN 


DEPARTMENT OF MATHEMATICS, STANFORD UNIVERSITY 


Communicated by H. S. Vandiver, December 9, 1950 
The striking analogy between the formulas 
So = T(s)t-*, t > 0, Re(s) > 0, 


and 


n+1 1 


(: -* ') |T|-*, Re(s) > ~ (1b) 


where, in the second formula, the integration is over the space of positive 
definite Xn matrices, |X | represents the determinant of X, and 7 is a 
positive definite matrix, leads one to conjecture that there should be a 
theory connecting Siegel modular forms and Dirichlet series possessing 
certain types of functional equations, similar to that created by Hecke con- 
necting Dirichlet series and modular forms of one variable. 

The first step in the construction of such a theory has been taken by 
Maass,! who showed that from modular forms of the second degree one can 
derive an infinite set of Dirichlet series possessing functional equations. 
Contrary to what occurs in the Hecke theory, the derivation of the func- 
tional equation is no longer a simple preliminary step, but requires some 
detailed analysis. Since the proof given by Maass is rather complicated, 
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and applicable only in the case of modular forms of the second degree, we 
propose to present here a sketch of a simpler proof for the result of Maass, 
which possesses the merit of being able to be extended to the general case. 
Complete results will be presented subsequently. 

Let F(Z) be a modular form of the second degree, satisfying the equation 


F((AZ + B)(CZ + D)“) = |CZ + D|*F(Z) (2) 
for (AZ + B)(CZ + D)~' an element of the modular group of the second 
degree, and possessing the Fourier expansion, 


F(X) (3) 


where the summation is over all non-negative semi-integers, and the imag- 
inary part of X is positive definite. If we use the parametric representa- 
tion, 


+ y*)/y ux/y 
ux/y u/y 


for a positive definite 2 X 2 matrix, cf. Maass,' two domains of importance 
below are 


> 0.9 > 0, <x <o, (4) 


D: O<a sgh Yat +7 (5) 
A’ S25 + ¥ 2 1. 


Let us decompose F(X) into the sum of the two functions defined by 
F(X) = (6) 


\T|=0 
where the summation is over all non-negative semi-integers of rank 0 or 1, 
and 


(7) 


where the summation is over all positive semi-integers. Then, cf. Maass, 


D x>0 iT} e(T) 


where e(7)) is the number of elements of the class of 7, and the summation 
on the right is over all distinct classes of 2 2 positive-definite semi- 
integers. Using the case n = 2 of (lb), we obtain for Re(s) sufficiently 
large, 


f(s) = + = 
IT) fy| X|*[FGX) — (9) 


= — Fi(iX)))du, (|X| = u). 
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(Here, and in what follows, we write fj for f°, dxdy/y?.) 

At first glance, one is tempted to try the classical method of splitting the 
right-hand integral into two parts, one over 0 < X < 1, and the other over 
| < X, followed by a change of variable X = X,~', and an application of 
the modular relation of (2). The presence of the non-constant term F(X) 
effectively stymies this approach. 

To circumvent this difficulty, we use an alternative method based upon 
analytic continuation. Let us illustrate this method in its starkest sim- 
plicity by deriving the functional equation for the Riemann zeta-function. 
We have, for Re(s) > 1, 


I'(s) ¢(2s) = x. (10) 


Inverting by means of the Mellin inverse, shifting the line of integration 
to the left past the two poles at s = 0, and s = '/2, taking account of the 
residues at these points, and then re-inverting, we obtain for Re(s) < 0, 


Now applying the functional equation for the theta function, we obtain 
the functional equation for ['(s)¢(2s).. This method assumes that we have 
already demonstrated the analytic continuability of I'(s)¢(2s) and know its 


meromorphic character. All this can be obtained by relatively crude methods 
based upon the use of the modular property of the theta function. Simi- 
larly, below, we shall assume that the analytic properties we require have 
already been demonstrated, namely the analytic continuability, and the 
existence of a finite number of poles. 

From (9), we obtain by use of the Mellin inversion formula, 


[FUX) — Fiax)]| = fonds, (12) 
4 JC 


where C is a straight line, Re(s) = a > so. Shifting the contour to the left, 
past all the poles of f(s), finite in number, we have for Re(s) < s; < 0, 


4 [FUX) — F(iX)] = R(u) + (13) 
A 21 JO 


where R(u) is a function of the type yaw *, and the integral is 0(u") as 
u—0,u > 0, foranyn > —s,. 
On the other hand, we have, from (2), 


Ss(FGX) — FGiX)] = Sa — FGiX)] 
= u* — + u* (14) 


| 
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Using the inequality t7(7X) > u V tots Por (x, y) ¢ A, where 


ty 
T = > 0, (15) 


it is easy to demonstrate that the third integral in (14) is O(e ) for some 
¢> Oasu—0,u > 0. If we can show that the first two integrals are of 


n 
the forms > ¢,u~“*, it will follow that the sum of the first two integrals 
must equal R(u). Separating out the constant terms corresponding to 7° = 


0, and using the parametric representation of 2 X 2 semidefinite matrices, 
we find, using the techniques contained in equations (25) and (78) of 
Maass’ paper that the integrals have the required form. Thus, 


T>0 


Hence, using (13), for Re(s) < s;, we have 


f(s) = Fo(iX —'))du 


wrx 
= r l 


The change of variable ¥ = X,~'! and integration term-by-term yields 
the desired functional equation. 

The general series discussed by Maass, involving the automorphic solu- 
tions of the equation 


2 2) = 18 
(» + 4 +r ) u=0 (18) 


may be handled similarly. 


1 Maass, H., ‘‘Modulformen zweiten Grade und Dirichletreihen,’’ Math. Ann., Bd 
122, 90-108 (1950). 
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THE STRUCTURE OF ALTERNA TIVE DIVISION RINGS 
By R. H. Bruck AND ERWIN KLEINFELD 
DEPARTMENT OF MATHEMATICS, UNIVERSITY OF WISCONSIN 
Communicated by A. A. Albert, December 16, 1950 


We shall outline the main features of a paper of like title which will ap- 
pear elsewhere. Except where the contrary is expressly stated, we assume 
for the sake of brevity that the ring R: (a) is alternative: (xx)y = x(xy) 
and (yx)x = y(xx) for allx, yor R; (5) is not associative; (c) has no divisors 
of zero: if xy = 0, at least one of x, y is zero; (d) has characteristic not 2: 
if 2x = then x = 0. 

A linear function A(x, ..., x,) from R to R is called skew-symmetric 
if it is zero when two or more of the arguments x, ..., x, are equal. If 
A, (i = 1, ..., m) are subsets of R, h(Aj, ..., A,) denotes the set of ele- 
ments h(a,, ..., @,) with a, in A;. The commutator (x, y) = xy — yx and 
the associator (x, y, z) = (xy)z — x(yz) are skew-symmetric.' We show 
that the linear function f(w, x, y, 2), defined by 


(wx, 2) x(w, 2) (x, z)w + f(w, 2), (1) 


is skew-symmetric and satisfies the identities 


3f(w, x, y, 2) = (w, (x, y, 2)) — (x, (y, 2, w)) + (y, (2, w, x)) — 


(2, (w, x, ¥)), (2) 
S(w, x, y, 2) = ((w, x), y, 3) + (w, x, (y, )). (3) 


Formulas (1)~(3) are repeatedly used in the proofs. 

The nucleus N = N(R) consists of all m in R such that (n, R, R) = 0. 
The center C = C(R) consists of all ¢ in R such that (c, R) = (c, R, R) = 0. 

LemMA 1. N(R) = C(R). 

CoROLLARY. (a, b, R) = 0 implies (a, b) = 0. 

The proof may be deduced from the following facts: If (a, b, R) = 0, 
then f(a, b, R, R) = 0, by (1), and hence ((a, 6), R, R) = 0, by (3); that is, 
(a,b)eN. lin, nxe N,n # 0, then (x, R, R)n = 0, by two uses of (1), and 
hence x « N. 

Lemma 2. If R is generated by three elements x, y, 2, then (1) ax? — 
Bx + y = Ofor a, B,yeC(R),a #0; (it) (x,y) #0. 

Coro.tiary. In any R (satisfying (a) — (d)) (a, 6) = O implies (a, b, 
R) = 0. 

In (i) we take a = (x, y, 2), B = (x, y, 2)(x?, y, 2), y = (x, xy,2)% Using 
known identities we find that (a, x, y) = 0 and by extensive use of (1), (3) 
we show that a(a,x) = 0. Thus (a,.S) = (a, S,S) = OwhereS =x uyuz. 
It is now easy to verify that ae C. From these results we deduce without 
much computation that C contains 6 = (x’, y, 2)*, (vw + x*, y, z)? = a + 
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6 + 268, and (hence) 8. Further use of (1), (3) shows that ye C. The 
proof of (ii) is now trivial. (It is essential for (ii), and also for the next 
lemma, that characteristic 2 be excluded. ) 

Lemma 3. Jf a, b are elements of R such that (a, b) # 0, then k is in C(R) tf 
and only if (k, a) = (k, b) = 0. 

By the Corollaries to Lemmas 1, 2, (x, y) = 0 if and only if (x, y, R) = 0. 
In any alternative ring R with this property, define K as the set of all kin R 
such that (k, a) = (k, b) = 0, and S, as the set of all s in R such that (K, s) 
= 0. ThusC c K,a,beS. By extensive use of (1), (3) we obtain the 
identity (K, R, R) (S, S) = 0. Therefore, under the hypothesis of Lemma 
3, K = C. 

Lemma 4. very element x of R satisfies a quadratic equation ax*® — Bx + 
= Owitha, B, ye C(R), a £0. 

If the element x of R is not in C, then x is notin V, by Lemma l. Hence 
(x, y, 2) # Ofor some y, z. By Lemma 2 (i), ax? — Bx + y = 0, where, in 
particular, (k, x) = (k, y) = Ofork = a,8,y. By Lemma 2 (ii), (x, y) # 
0, and hence, by Lemma 3, a, 8, ye C. Also a # Oby Lemma 2 (i). On 
the other hand, if x is in C, take 8 = 2ax, y = x? for any non-zero center 
element a. 

In view of Lemma 4, C contains a non-zero element. We define R//C 
as the set of all formal fractions x/c, x e R, ce C, ¢ # 0, with: (i) x/c = 
y/d if and only if dx = cy; (ii) (x/c) + (y/d) = (dx + cy)/(ed); (iii) 
(x/c)(y/d) = (xy)/(ed). Then R//C satisfies our initial hypotheses (a) — 
(d). Moreover R//C has unit ¢/c and center F = C//C, the quotient field 
of C. We imbed R in R//C by making .the identifications x = (cx)/e. 
Clearly R//C is an alternative quadratic algebraic algebra (without divi- 
sors of zero) in the sense of Albert. Therefore we may state the main 
theorem: 

TuHeoreM A. [f R 1s a not-associative, alternative ring without divisors of 
zero and of characteristic not 2, then C(R) contains a non-zero element and R 
can be imbedded in R//C, which is an alternative division algebra of order 8 
(a so-called Cayley-Dickson algebra) over its center F = C//C. 

Corotiary |. If, in addition, C is a field, Ris a Cayley-Dickson algebra 
over C. 

CoROLLARY 2. Every alternative division ring of characteristic not 2 is 
either (1) a field or skewfield or (11) a Cayley-Dickson algebra over its center. 

Coro.iary 3. Every ordered alternative ring ts associative. 

For Corollary 3, note that if R is ordered then R has characteristic 0 and 
no divisors of zero. If C # 0 then R//C can be ordered by defining x/c¢ > 
0 if and only if cx > 0. If R is not associative we obtain a contradiction 
from Theorem A. More directly, since every subring of an ordered ring is 
ordered, we may assume that R satisfies the hypotheses of Lemma 2 and ob- 
tain a contradiction by observing that R//C contains a quaternion algebra. 
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The more detailed paper will also contain important applications of 
Theorem A to the theory of projective planes. We quote without explana- 
tion: 

THEOREM B. If a projective plane m satisfies the Theorem of the Complete 
Quadrilateral, then some coordinate ring R of m is an alternative division ring 
of characteristic not 2 (and conversely). Moreover, every coordinate ring of x 
is isomorphic to R; equivalently, the collineation group of w is transitive on 
quadrilaterals. And, finally, either x is Desarguesian or R is a Cayley- 
Dickson algebra over C(R). 

Turorem C. A linearly ordered projective plane which satisfies the 
Theorem of the Complete Quadrilateral is Desarguesian. 


' Zorn, M., ‘“‘Theorie der alternativen Ringe,’’ Abh. Math. Sem. Hamburgischen Univ., 
8, 128-147 (1931). 

2 Albert, A. A., ‘‘Absolute-Valued Algebraic Algebras,”’ Bull. Am. Math. Soc., 55, 763- 
768 (1949). 


HAUSDORFF MEASURE AND LEBESGUE AREA 
By HERBERT FEDERER 


DEPARTMENT OF MATHEMATICS, BROWN UNIVERSITY 


Communicated by O. Zariski, December 16, 1950 


1. Introduction.—Following the creation of modern integration theory 
by Lebesgue and Carathéodory, it was natural to extend the concept of 
area to arbitrary continuous surfaces. This was done in several intuitively 
satisfactory ways, each offering its own geometric and analytic interest. 
It is now possible to associate with every surface several numbers, each of 
which claims a good right to be called the area of the surface, but the rela- 
tionship between these numbers is far from obvious in the general case. 
Efforts to determine which equations or inequalities are satisfied by these 
several areas of a surface have greatly added to our knowledge of signifi- 
cant metric and topological properties of continuous surfaces. Thus the 
lack of uniqueness in the definition of surface area has inspired the creation 
of a comparative theory, whose main results are summarized in this note. 
Some of these results are explicitly or implicitly contained in the literature, 
while others are published here for the first time. 

2. Multiplicity Functions.—Suppose f maps X into Y and ye Y. We 
may attach various meanings to “the multiplicity with which f takes on the 
value y."’ The simplest interpretation is given by N(f, y), which is the 
number (possibly ©) of elements of {x|f(x) = y}. 

If a topology is distinguished on X, we define L’(f, y) to be the number 
(possibly © ) of components of }x| f(x) = y} ; and if X is a manifold with 
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boundary, such as a closed k-cell, we also consider V(f, y), the number 
(possibly ©) of those components of {x|f(x) = y}which do not meet the 
boundary of X. 

Furthermore, if Y is a compact locally connected subset of a k-dimen- 
sional manifold and Y is a k-dimensional Euclidean space, we define S(f, y) 
and M(f, y) in terms of homology theory as in [6]. 

3. Hausdorff Areas—We assume that k < n are positive integers, 
H,, , is k-dimensional Hausdorff measure over n-dimensional Euclidean 
space (see [5]), and that C,, , is the class of all continuous functions on a 
fixed closed k-cell into n-space. 

Motivated by the idea that the area of a mapping f « C,. , should be a 
modification of the measure of the range of f, which weights each value of 
f with the multiplicity with which it is taken on, we are led to consider 


N*(f) = SN(f, 
U*(f) = SU(f, wy, 
V*(f) = wy 


as the Hausdorff areas corresponding to the multiplicity functions N, U, 
V. 

If k = n, the multiplicity functions S and M are meaningful and //,, , 
is simply k-dimensional Lebesgue measure; in this case we can also define 
the areas 


S*(f) = S'SU, xy, 
M*(f) = SMC, ny. 


4. Integralgeometric Areas.—Another plausible method for associating 
an area with a k-dimensional surface in n-space is to form all the orthogonal 
projections of the surface into k-space, compute the area of each such pro- 
jection according to any one of the definitions applicable to the simpler 
case in which k = n, and then define the area of the surface as a suitable 
linear function of the areas of its projections. 

Let py, » be the function on n-space to k-space which maps each n-tuple 
(x1, X2, ..., ¥,) onto the k-tuple (x), x2, ..., x,). The most general orthog- 
onal projection of n-space onto k-space is a functional superposition of the 
form py, .R, where R is an element of the orthogonal group G, of n-space. 
Thus the projections of a function f « C,, , are the mappings (Py, 
Cy, x with ReG,. The postulate for orthogonal invariance of the linear 
operation used to define the area of f leads to integration with respect 
to the Haar measure ¢, of G,, normalized so that ¢,(G,) = 1. Using the 
coefficient 

ben = + 1)/2]- le + 1)/2)-T — k + 
we thus associate with fC, , the integralgeometric areas 
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N**(f) by, n° S N*(Dn, Rf 
U**(f) = by, n* U* (Pn, Rf)denR, 
V**(f) by, n* V* (Dn, )denR, 
S**(f) by, S S*(Pn, )denR, 
M**(f) by, S M* (Pn, rRf)denrR, 


corresponding to the multiplicity functions NV, U, V, S, M. 

5. Lebesgue Area.—A mapping f € C;, , is said to be a polyhedron if and 
only if the domain of f can be so triangulated that f maps each simplex of 
the triangulation baricentrically onto a rectilinear simplex in n-space. 
Every rectilinear simplex in n-space has an elementary k-dimensional 
measure, in the sense that its k-dimensional Hausdorff measure is comput- 
able from its vertices by determinants. When applied to polyhedra, all 
reasonable definitions of area yield the same value, computable by as- 
sociating with each simplex of a suitable triangulation the elementary k- 
dimensional measure of its image in m-space and adding all the numbers so 
obtained. 

The class of polyhedra is dense in C;, ,, relative to the topology of uni- 
form convergence. For any f € C,, ,, we may therefore define L(f), the 
Lebesgue area of f, as the lower limit of the areas of polyhedra approxi- 
mating f. 

6. Comparison of Areas.—Suppose f ¢ C,, , and the domain of f is the 
set of all k-tuples (x1, x2, ..., %,) such that 0 < x, < 1 fori = 1,2, ... ,k. 
The various areas of f are related as follows: ; 

(1) N*(f) > UMP) = 

(2) N**(f) = U**(f) > V**(f) a S**(f) < M**(f) < L(f). 

(3) If k = n, then N*(f) = N**(f) > U*(f) = U**(f) > V*(f) = 
V**(f) > S*(f) = S**(f) < M*(f) = M**(f) = L(f). 

(4) N*(f) = N**(f), U*(f) 2 U**(f), V*(f) 2 V**(f). 

(5) If the range of f is (HZ, », R) rectifiable (see [5]), then N*(f) = 
N**(f) and U*(f) = U**(f). 

(6) If N*(f) = N**(f) < o or U*(f) = U**(f) < o, then the range 
of f is (Ty, », rectifiable. 

(7) If {y| > 0} is », rectifiable, then V*(f) = V**(f). 

(8) If V*(f) = V**(f) < o, then {y|V(f, y) > O} is (Hy, », R)rectifi- 
able. 

(9) If f satisfies a Lipschitz condition, which means that there is a 
number A such that 

— f(y)| A-|x — y| for x, y domain f, 
then N*(f) = U*(f) = V*(f) = N**(f) = U**(f) = V**(f) = S**(f) = 
M**(f) = L(f). 

(10) If k = 1, then N*(f) = U*(f) = V*(f) = N**(f) = U**(f) = 

v**(f) aes S**(f) = M**(f) = L(f). 
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(11) Ifk = 2, then S**(f) = M**(f). 
(12) Ifk = 2andL(f) < then M**(f)= Lif). 
(13) Ifk = 2andn = 3, then M**(f) = L(f). 


(14) If Hy, (range f) = 0, then by, a-L(f) < (7) 


(15) If U**(f) < @, then by, < (7) 


(16) Ifk = 2and H,, ({y|V(f, y) > 0}) = 0, then M**(f) = L(f). 

(17) Ifk = 2and V**(f) < ©, then M**(f) = L(f). 

(18) If k = 2, = 3 and f is a non-parametric surface, which means 
that 


bs, 2lf(x)] = x for x domain f, 


then N°(f) = U°(f) = V°C(f) = NPM f) = = Sf) = 


M**(f) = L(f). 
(19) Ifk = 2andf isa saddle surface, which means that 


sup glf(x)] = sup glf(x)] 
xeA xeB 


whenever A is an open subset of the domain of f, B is the boundary of A and 
gis a numerically valued linear function on n-space, then V*(f) = V**(f) = 
S**(f) = M**(f) = L(f). 

The inequalities in (1) and (2), except for the last one which involves 
L(f), follow from the corresponding inequalities for the multiplicity func- 
tions. 

Since M is a lower semicontinuous function, so are M* and M**, by 
Fatou’s lemma. Furthermore it may be verified directly from the defini- 
tions that M**(f) = L(f) whenever f isa polyhedron. These facts imply 
the last inequality in (2). 

In (3), only the equation M*(f) = L(f) isnot obvious. It was proved in 
[6]. 

The propositions (4), (5), (6), (7) and (8) are obtained by applying the 
fundamental structure theorems for sets of finite Hausdorff measure (see 
[5], Theorem 9.7) to the definitions of Hausdorff area. 

Reference to [4] and [5] yields the equation N*(f) = S**(f) = L(f) for 
Lipschitz mappings f. Hence (9) follows from (1), (2) and (4). Further- 
more (10) may be derived from (9) using parametrization by arc length. 

If k = n = 2, then {y|S(f, y) < M(f, y)}is countable (see [3]). This 
implies (11). 

The proposition (12) is obtained from [4], [7], [8] and [9]. Ifn = 3 
and M**(f) < o, it follows from [2] that L(f) < ©. Consequently (13) 
may be deduced from (12). These arguments may be generalized to yield 
(14), (15), (16) and (17). 
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A proof of (18) is contained in the last section of [5]. It appears that 
much of the technique employed there could be generalized to higher di- 
mensions. 

Extending certain methods developed in [1], it is possible to show that 
L(f) = V*(f) for every saddle surface f ¢ C, ,. If in addition n = 2, then 
V*(f) = S*(f), by virtue of (3) and (11). 

In attempting to prove (19) for a general saddle surface f ¢ C, , one may, 
in view of (2), assume that S**(f) < ©. For each RG, the projection 
Pn Rf € C2, 2 is a saddle surface, hence V*(p,, 2Rf) = S*(p, 2Rf). By in- 
tegration over G,, V**(f) = S**(f) < . Reference to (17), (11) and (4) 
completes the proof of (19). 

In view of the preceding results, the relations between the various 
Hausdorff and integralgeometric areas are now well understood.  Ex- 
amples exist to show that we have listed all inequalities which involve these 
areas and which are valid for arbitrary continuous surfaces. Corresponding 
to each inequality, we know in addition a geometric criterion of measure 
theoretic, point set topological or combinatorial nature which characterizes 
those surfaces for which equality holds. 

Among the Hausdorff and integral geometric areas the function M** 
appears at present to be most closely related to the Lebesgue area L. 
The functions M** and L agree for all two-dimensional surfaces in three 
space and, in the absence of counterexamples, one is tempted to conjec- 
ture that the equality holds in general. This wish is natural because up to 
now the theories of Hausdorff areas and of Lebesgue area have been con- 
nected only by the intermediate consideration of the integralgeometric 
area M**. However, future research may produce a new multiplicity 
function with a corresponding Hausdorff area more directly related to the 
Lebesgue area. 

! Besicovitch, A. S., “On Surtaces of Minimum Area,” Proc. Cambridge Phil. Soc., 44, 
313-334 (1948). 

2 Cesari, L., ‘‘Caratterizzazione analitica delle superficie continue di area finita sec- 
ondo Lebesgue,’’ Ann. R. scuola normale super. Pisa, §.2, 10, 2538-294 (1941) and 11, . 
1-42 (1942). 

§ Cesari, L., “Sui punti di diramazione delle trasformazioni continue e sull’area della 
superficie in forma parametrica,”’ Univ. Roma e Ist. Naz. Alta Mat. Rend. Mat. e Appl., 
$.5, 3, 37-62 (1942). 

Federer, H., ‘Coincidence Functions and Their Integrals,’ Trans. Am, Math. Soc., 
59, 441-466 (1946). 

6 Federer, H., ‘The (g, k) Rectifiable Subsets of n Space, [bid., 62, 114-192 (1947). 

Federer, H., ‘Essential Multiplicity and Lebesgue Area,’”’ Proc. Nati. Acap. Scr., 
34, 411-416 (1948). 

7 Morrey, C. B., Jr., “An Analytic Characterization of Surfaces of Finite Lebesgue 
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®’ Radé, T., “On Continuous Mappings of Peano Spaces,”’ Trans. Am. Math. Soc., 
58, 420-454 (1945). 
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ON THE STRUCTURE OF EXTREMAL POLYNOMIALS 
By MICHAEL FEKETE 
HEBREW UNIVERSITY, JERUSALEM * 
Communicated by J. von Neumann, November 1, 1950 


I. Introductory.—1. The concept of extremal polynomials on a given 
pointset of the complex plane was introduced and their properties were first 
studied in a joint paper! on the location of the zeros of certain minimum 
polynomials* by M. Fekete and J. von Neumann. 

2. The present note’s aim is a realization, by one of these authors, of 
their intention, now some three decades old, to deduce from the definition 
of the extremal polynomials a representation for all extremal polynomials of 
given degree n, (a) when the set S to which the extremal polynomials belong 
is an arbitrary bounded, finite or infinite closed pointset in the complex 
plane (consisting, if finite, of m + 1 points at least); (b) when the set S and 
together with it also the extremal polynomial p(z) to be represented exhibit 
some properties of symmetry (e.g., S is symmetric in the x-axis and p(z) has 
real coefficients only). 

3. The representation afforded here will put into evidence anew two 
simple (also directly derivable) laws about the distribution of roots of ex- 
tremal polynomials in relation to the set S to which these polynomials be- 
long—laws which not only remind us of the classical propositions of Lucas 
and Jensen on the location of the zeros of the derivatives, in relation to the 
zeros of the original polynomials, but also contain those results as particular 
cases. Furthermore, the representation enables one to give refinements of 
the above laws, similar to those obtained by Professor J. L. Walsh, in his 
most remarkable investigations on critical points,‘ under additional 
assumptions about some kind of symmetry of the zeros of the given poly- 
nomials. This holds good in cases where the assumptions of Walsh do not 
imply any condition on the number of zeros considered, but refer to their 
location only. 

4. Weshall conclude this introductory by recalling the definitions of the 
underlying concepts (with their immediate consequences) and by indicating 
also the connections between extremal polynomials and some minimum 
polynomials? considered by Professor Fejér whose researches suggested our 
investigations on extremal polynomials and whose discoveries in this field 
gave the impulse and served for guidance in shaping aims as well as 
methods. 

5. Given a bounded, finite or infinite-closed pointset S of the z-plane 
and a polynomial A(z) = 2" + a2""! + ... + a, of degree n with the lead- 
ing coefficient 1, we call another polynomial B(z) = 2" + bz"~!+ ... +b, 
of the same degree and with the same leading coefficient an under polynomial 


= 

| 

if 95 
i 

4 

; 


96 MATHEMATICS: M. FEKETE Proc. N. A. S. 


of A(z) on S when |B(z)| < |A(z)| for each z ¢ S except the zeros ze S of 
A(z)—if such there be—at which also B(z) vanishes. 

A polynomial C(z) = 2” + 2"-! + ... + c, is called an extremal poly- 
nomial on S if it has no underpolynomial B(z) = 2* + bz"! + ... + by 
on S. 

6. An immediate consequence of these definitions is that if S is a set of n 
points, say 2, %, ..., 2,, then the unique extremal polynomial on S whose 

n 


degree is n is II(z — 2). Furthermore, by a result of Fejér (‘‘No root of an 


extremal polynomial C(z) = 2" + q2"~! + ... c, on S can lie outside the 
convex hull of S provided that this set consists of at least m points,’’), all 
extremal polynomials of degree m on a convex set S not a single point have 


n 
the form II (z — z,) with z,e Sfor1 <u <n. 
ual 


7. If Sis neither reduced to n points nor is a convex set and if C(z) is 
an extremal polynomial on S with some of its zero points not in S then C(z) 
admits a decomposition into two factors, say C*(z) and C**(z), the first 
differing from 0 throughout S and the second having all its zero-points in S. 
Then C*(z) is an extremal polynomial on S of degree < n. 

8. The results of the foregoing two sections clearly indicate that the 
problem of representation can be restricted lo the case of extremal polynomials 
differing from 0 on S. 

9. Given a bounded, finite or infinite-closed pointset S of the z-plane, 
one can associate to every polynomial A(z) = 2" + a2"-! + ... + d,,inan 
arbitrary but definite way, a non-negative finite number D = D{A(z); S}, 
as a deviation of A(z) on S (from 0). Such a deviation is called, in accord- 
ance with Fejér, a monotone deviation if D decreases when A(z) is replaced 
by its underpolynomials B(z) (if such there be).? 

10. Only extremal polynomials on S can minimize any given monotone 
deviation D; thus, the minimum polynomials of such deviations are neces- 
sarily extremal polynomials.’ Conversely, if C(z) = 2" + a2"-! + ... + 
¢, is an extremal polynomial on S, differing from 0 throughout S then C(z) 
is a minimum polynomial of the (generalized) ‘‘Tschebyscheff deviation”’ 
D{A(z); S} = ae sh with the weight-function w(z) = car which 
is clearly a monotone deviation. Thus the representation of extremal poly- 
nomials we are aiming at in the sense of §8 will settle the problem of char- 
acterization for all minimum polynomials which differ from 0 throughout S 
on which the monotone deviation is defined. 

II. The Representation of Extremal Polynomials in the General Case.— 
1. We start this part of our note with the formulation of our (main) 

TuHEeorEM I: Let S be a bounded, finite or infinite pointset of the z-plane, 
consisting of at least n + 1 points when finite and being closed when infinite. 
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Let C(z) = + +c, bean extremal polynomaal on S different 
from 0 throughout S. Then there are m + 1 points, say 2, 2, ..., 8m, in S 
and m + 1 positive constants, say Xo, 1, ..., Am Satisfying the conditions 


m & 2n, 2, A, = | such that C(z) admits the representation: C(z) = 


A(z) 

M(z) 

n and with leading coefficients equal to 1, given by the formulas 
6 

nia) = ana) = ate) = - 


where A(z) and M(z) are polynomials of the respective orders mand m — 


2. We rely in the verification of the theorem just formulated upon the 
following 

Lemma: Let S bea bounded, finite or infinite-closed pointset of the complex 
z2=x + ty plane. Let p(z) = 2" + pis"! + ... + pp be a polynomial 
different from O throughout S. Let us associate to S and p(z) a pointset 2, = 
Len(S; p(z)) of the Euclidean space Ex, of 2n dimensions, consisting of the 
points with the coordinates x, = R(z"~'p(z)), v1 = I(2"~*p(z)), ...,X%, = 
R(p(z)), ¥n = I(p(z)) when 2 ranges over the set S. 

A necessary and sufficient condition that p(z) be an extremal polynomial on S 
1s that the convex hull T of 22, contains the origin O of Ean. 

3. The proof of this lemma can be based on the following fact. Let 
p(z) = + + ... + OforzeS; then p(z) possesses an under- 
polynomial g(z) = 2” + qz + ... + q, on S if and only if there is a poly- 
nomial 


r(z) = + ... +H, (1) 
of degree n — | satisfying the inequality 
\p(z) — r(z)| < |p(z) + r(z)| whenever ze S. (2) 


4. The fact just mentioned is an immediate consequence of the defini- 
tion of underpolynomials of p(z) on S. Indeed, when 


iq(z)| < |p(z)| for every S, 
then for r(z) = p(z) — q(z) = (bi — q1)2""1 + ... + (Pa — Qn) the relations 


+ r(2)| = = |2 - > Ip(2)| x 


( ple > |p(e)| > = — 


will clearly hold everywhere on S. Conversely, if there is a polynomial 
r(z) of the type (1) which satisfies (2) then g(z) = p(z) — ar(z) surely 


| 
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furnishes an underpolynomial of p(z) on S provided that a@ satisfies the 
conditions 
p(z) 


r(z) 


<a < Min. (3) 


zeS F 


2) 


> O whenever ze Indeed 


(Remember: by (2) we have Ra 


r(z) | € r(z) 


Ir(z) ? r(z) 


5. From the fact just established the truth of the lemma easily follows: 

1°. The Condition Is Sufficient: For if 0 belongs to T, then p(z) ts an 
extremal polynomial on S. Otherwise the inequality (2) would hold for 
some r(z) = 712"-! + ... + 7,, and consequently, we would have 


R(p(z))R(r(z)) + I(p(z))I(r(z)) > O whenever z « S, (4) 


which can be written in the form 


+ +... + + 
t,l(p(z)) > O (5) 


if we put 7 = 5) — th, ...,7n = Sn — thn. 
Now (5) indicates that the half-space 


contains the pointset 22, and, therefore, also its convex hull ['; but its 
boundary, the hyperplane + + .-. + + = 0, passes 
through the origin 0 of /2,. Hence 0 has a positive distance from Tl. Thus 
the sufficiency of the condition is established. 

6. 2°. The Condition Is Also Necessary: Indeed, if 0 is exterior to T° 
then there is a hyperplane in /%:, which separates 0 from I and, therefore, 
there exists a half-space (6) which contains [ and, consequently, also 22, in 
its interior; that is, (5) is fulfilled when zg ranges over S and thus (4) and (2) 
also hold when r; = 5s; — th, ...,% = Sn — tt, and r(z) is defined by (1). 
But (2), as we know, implies the existence of underpolynomials g(z) of p(z) 
on S. Thus the necessity of the condition is established and the proof of 
our lemma is complete. 


| 
| 
4 
| 


Vov. 37, 1951 MATHEMATICS: M. FEKETE 99 


7. The gap between this lemma and our main proposition can now be 
bridged by the following 
THEOREM II: Let S be a pointset as described in the lemma but consisting 
at least of n + 2 points. Let C(z) = 2" + q2""-! + ... + €, an extremal 
polynomial on S which differs from O when z ranges over S. Then there are 
m + 1 points in S, say 2, 21, ..., 2m withn & m & 2n, and m + 1 positive 
constants, say Xo, Mr, .--, Am with a A, = 1 such that 


m Zu 
dy Cls,) Ofora=0,1,...,” l, (7) 

and C(z) ts an extremal polynomial also on the subset s = | 2, 21, ..., Zm} of S. 

8. This theorem is an easy consequence of our lemma when we apply to 
it Carathéodory’s well-known theorem on the convex hull of arbitrary 
pointsets in a Euclidean space. In fact, by the lemma the convex hull I of 
Y2,(S, C(z)) necessarily contains the origin 0 of #2, when C(z) is an ex- 
tremal polynomial on S, different for 0 throughout S; hence, by Cara- 
théodory’s theorem, 0 is a centroid of m + 1 well-chosen points of Y2,(S, 
C(z)) with | < m < 2n. This fact is equivalent with the existence of m + 1 
points 2, ..., 2, in Sand m + 1 positive constants ko, ..., R» satisfying the 
equations 


k,R(2,°C(2,) => k,1(2,°C(s,)) = 0 fora = 0,1, ...,” — 1. 
Hence 


C(z,) 
which is the same as (7) when we put A, = &,{C(z,)|?: SO k,/C(z,)|%. In 


order to verify m 2 n we remark that, by (7), 0 necessarily belongs to the 
convex hull of the set 2»,(s,C(z)) associated with s in the same way that 
Y2n(.S,C(z)) is associated with S. Thus, by our lemma, C(z) is an extremal 
polynomial on s and, therefore, s cannot consist of fewer than n + 1 points; 
m 
otherwise, each polynomial with leading term 2” and divisible by II (z — z,) 
would yield an underpolynomial of C(z) on s,; this completes the proof. 
9. With the aid of this result we can establish the truth of our main 
theorem: Consider the polynomial 


C(s,) — 3, 


N(z) = 


| 
| 
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where 2, 21, ..., Ao, Ai, ---, Am are the numbers associated with C(z) in 


the sense of Theorem II and g(z) = II (zg — 2%). By (7) we clearly have 
r=0 


m >} n 
> oom = 1, thus N(z) is a polynomial of degree m with leading coefficient 
1. A further application of (7) leads to the fact that the polynomial N(z) 
has the divisor 2". (Consider the derivatives N (z) of N(z) of order B = 
- 0,1,...,” — Land apply (7) in the evaluation of V® (0)). Now, in view of 
(8) and the divisibility of N(z) by 2", we obtain 
s* C(s,) 3 — 8, (2) 
whence 
C(z)M(z) = Au =g"*+... 


This completes the proof of our main theorem. 
10. Remark. The polynomial 


A(z) = > Ay +. 


u=0 


because of the conditions \, > 0, > \, = Lis an extremal polynomial on 
the set s = {20, 1, ..., 2m}. In fact, the existence of an underpolynomial 
= 

Lig 
(2,) Z— 
would imply 
L(z,) A(2,) m 
"(2,) > g'(z,) 


u=0 u=0 


in contradiction to 


l= 


2, | p=0 g'(z,) 


Hence all divisors of A(z) (among them ./(z)) are extremal polynomials on 

11. If 2, 2, ..., 2m are not collinear then the zeros of A(z) lie interior to 
the polygon which is the convex hull of s. In case Zo, 21, ..., 2m are collinear, 
the convex hull is reduced to a line-segment o on which, between two con- 
secutive points of s, exactly one zero of A(z) will lie. By a well-chosen linear 


(9 | 

| 
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transformation of the z-plaxe o will be transformed into a segment of the 
real axis and, thus, A(z) (and its divisors) into polynomials with real co- 
efficients only. Now, under the condition that a set S consists of real 
points only and, therefore, all extremal polynomials C(z) on it have real 
roots and, consequently, real coefficients only, we can improve our Theorem 
II, namely we can replace in it the inequality n < m < 2n by the equality 
mM = ‘this is because of the possibility of simplifying our above lemma, 
subst: '1iting for the set ¢ 2, a set in /, of points with the coordinates 
= 2"~'p(z), = 2"-*p(z), ..., = p(z) when z ranges over S). The 
corresponding refinement of our main theorem leads to N(z) = 1, C(z) = 
A(z) and thus to the more precise result on the distribution of the zeros of 
C(z) in relation to the points in S as indicated above. 

12. The representation of all extremal polynomials C(z) of degree n as 
described at the beginning of Part II may yield still further information on 
the location of zeros of those polynomials: each theorem on the roots of 
equations of the type 


m Ay 
2 — Sy 


(10) 


combined with our main theorem is eventually a new source for such in- 
formation; the reader can find an excellent summary of the most interesting 
facts concerning (10) in ‘‘The Geometry of the Zeros of a Polynomial in a 
Complex Variable’’ by Professor M. Marden.* But a feature of the results 
so obtainable is that they presuppose the exact knowledge of the points z, 
appearing in (10) which we generally do not possess when the given set S 
and the extremal polynomial C(z) are arbitrary. Only under certain addi- 
tional assumptions about them, under the condition that they have 
properties of symmetry can we attack the problem with better chance of 
success. And here we shall rely upon representation of the extremal poly- 
nomials of modified type which utilizes such symmetries. 

III. Representation of Extremal Polynomials with Symmetry Proper- 
ties.—1. First we consider extremal polynomials C(z) = 2" + c2"~' + 
... + C,0n a given set S symmetric in the x-axis which are different from 0 
on S and have real coefficients only. By our main theorem, C(z) is the 
divisor of a polynomial A(z) of degree m,n < m & 2n, given by the formula 
(9). The subset s = {2, 2, ..., 2m} of S which occurs in (9) might be, 
however, non-symmetric in the x-axis and even if it is symmetric, it may 
happen that factors \, which belong to conjugate terms of (9) are different 
from each other. In this case the fact that C(z)|A(z) cannot be used to re- 
gain the already known laws on distribution of zeros of C(z) in the case now 
considered and to refine them as we have in mind. But we can easily find a 
polynomial 


u=0 

| 

| 

| 
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of degree m* © 4n + 1, divisible by wn where s* = {2o*, 2", ...,8%me} isa 
subset of S symmetric in the x- -axis; and ‘ AZ = 1, with AF = AJ > 0 when 


z, = 2; finally, g*(z) = i (z — 2). "te fact, the m equations (7) in 


theorem II hold, under our ied nt assumptions, together with the further 
n equations: 


m 2 
= 0); a=0,1,....”-1; 


x, C(2,) 


thus also 


m 
r ) 0; 
is valid, leading to 


i 
— 0; 0 < a < with Ay 


* C(z,) u=0 
>0; =A; when z, = 2 (12) 


where s* = {z,*, ..., consists of all (distinct) points of {2, ..., 
Zm, We deduce, from (12), the fact C(z)|A*(z) by the same arguments 
which lead from (7), to C(z)|A(z). 

2. This being settled, which means we have a representation for C(z), 
we derive the properties of the roots of C(z) from the corresponding results 
concerning A*(z). Now the laws of locations for the zeros of A*(z) in rela- 
tion to the subset s* of S are the same as for the zeros of derivatives of poly- 
nomials L(z) with real coefficients with their roots at 2)*, 2i*, ..., Sas 
(this is because of the fact that A*(z) can be uniformly approximated by 

m* * 
polynomials L(z) = where and a, are natural numbers, satis- 
op 
fying a, = da when z, = 2°). Hence the possibility of extending the 
theorems of Jensen and its various refinements by Walsh‘ concerning the 
distribution of zeros of the derivatives p’(z) of polynomials p(z) with real 
coefficients in relation to the zeros of the original polynomial p(z), to ex- 
tremal polynomials C(z) with real coefficients on a given pointset S sym- 
metric to the real axis in relation to this set. The reader may replace in 
those theorems assumptions on the roots of p(z) = 0 by assumptions con- 
cerning S and the statements about the roots of p’(z) = 0 by statements 
about the zeros of C/z). 

3. By arguments similar to the above can the case of symmetry of S 

and the zeros of the extremal polynomials, both with respect to the origin, 
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be treated and result about the distribution of roots obtained, analogous to 
facts concerning zeros of derivatives of polynomials possessing such sym- 
metry properties.’ We shall explicitly formulate but one result of this 
kind: 

Let S be a pointset symmetric in 0 and lie in the two closed circular 
regions 

Ri: |z — a| < rand R:: |z + <r 

where we have ja, 2 2r. Then all zeros of all (odd) extremal polynomials 
on S other than a simple zero at 0, lie in the closed regions R, and R». 

4. Similarly, results on zeros of derivatives valid under conditions of 
symmetry of higher order’ with respect to the origin can be extended to ex- 
tremal polynomials under corresponding symmetry assumptions. 


* The Institute for Advanced Study, Princeton, New Jersey. 

1 Jahresbericht d. Deutschen Math. Vereinigung, 31, 125-138 (1922). 

2 Cp. L. Fejér, “Uber die Lage der Nullstellen von Polynomen, die aus Minimum- 
forderungen gewisser Art entspringen,’”’ Math. Ann., 85, 41-48 (1922). 

3 Loc. cit. reference 1, pp. 135-137. 

4 The Location of Critical Points of Analytic and Harmonic Functions, Am. Math. Soc. 
Colloquium Publications, Vol. 34, 1950, Chapter IT. 

5 Loc. cit. reference 1, pp. 187-138. 

6 Am. Math. Soc. Mathematical Surveys, Number 111, Chapters I and II. 

7 Loc. cit. reference 4, Chapter IIT. 


ON THE BOUNDARY BEHAVIOR OF MINIMAL SURFACES 
By Hans Lewy 
UNIVERSITY OF CALIFORNIA 
Communicated by G. C. Evans, December 16, 1950 

This paper deals with the analytic extension of a minimal surface across 
its boundary. 

Lemma 1: Let U (x, y) be harmonic and absolutely bounded in the half 
plane y > O. Assume U (x, y) to possess! continuous boundary values 
U(x, 0) on y = O which admit limits U(@, 0) and U(—@, 0). Assume 
furthermore that the total variation from x = —” tox = @ of U(x, 0), 
denoted by [U(x, 0)], is bounded. Denote similarly by y) ] the 
total variation of U (x, y) as x ranges from a to b where a and b > a are any 
two numbers including + ©, and y is kept constant. Then, for y > 0, 

(U(x, y)] < T2.[U(x, 0)], 


and 


lim T2[U(x, y)] = T3[U(x, 


| 
i 
it 
| 
0 
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Proof: »U(x, y) = U(P) is for y > 0 expressed ‘by the formula 
U(P) = —« f(Q)d6(PQ) 


where 0(PQ) is the angle from the negative y-axis to the vector from P to 
Q, and f(Q) = U(Q) for Q on the x-axis. Integration by parts yields 


U(P) = PQ)df(Q) + (f(~) — f(—#))/2. 
Differentiating with respect to x we find 


which we also write with 6, instead of (0/0x)6(PQ) and where we note that 
—6, is positive. Thus 


(U(x, y)] = |dU(x, y)| SS — 0, dx\df(Q)|/*. (3) 
Now — 6,dx = — (—/2) = Hence the desired result 


Similarly 73[U(x, y)] < — 0(P2Q)]\df(Q)|, Pi = Pla, y), 
= P(b, y). The bracketed expression is bounded by 7m and tends uni- 
formly to ras y—> 0, fora +e < Q < b — ¢,andtoO for Q > b+ « or for 
Q <a — eife > Ois assigned sufficiently small. For the two missing inter- 
vals from a — e toa + e and from 6b — eto b + ¢ we gather from the con- 
tinuity of f(Q) which entails (as is familiar) that of T7{f(Q)], that their 
contribution to the last integral becomes for all y > 0 less than an arbitrary 
small e’ > 0 provided « is sufficiently small. Thus Jim S2(0(PiQ) — 


0(P2Q)||df(Q)| < |df(Q)| + we’ < <r + me’, whence 
lim y)] S T3{U(x, 0]. 


But since also lower semicontinuity holds, i.e., 


lim T3{U(x, y)] > 0)] 
0 
no matter how small e, hence also for ¢ = 0, we have the desired continuity 
of T2(U(x, y)] as y > 0. 
A theorem for the circle analogous to Lemma | was proved by Evans.’ 
Lemma 1’: Jf U,(x, vy), Us(x, vy), Us(x, y) are harmonic and absolutely 
bounded in y > O and assume on y = O continuous boundary values of 
bounded total variation, then the length L(y) of the curve of coordinates U,(x, y), 
Us(x, y), Us(x, y) for y fixed and — © <x < ©@ tends to the length Lo of the 
curve 0), U2(x, 0), Us(x, 0) = fil(Q), fo(Q), fa(Q), as O. 
Proof: wL(y) = © dx ((QUi(x, y)/Ox)* + (QU2/dx)? + 


q 
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— df(Q))? + ...1" 


since > 0. 
Now 
— 9:\dfi(Q)|)? < — + 


because this formula expresses the fact that a straight line joining the origin 
to the point with coordinates — 6,|df1(Q)|, — 92|df2(Q)|, 
— 6,|\df;(Q)| is the shortest line of this property. Hence 


rL(y) < dx — 0(\df,|? + ...)” 
and by Lemma 1, 
lim < lim dx — 0,((dfi(Q)|? + ...)'* = 
But the semicontinuity of the length requires 
lim L(y) > Lo. 


0 
Thus L(y) is continuous as y > 0, q. e. d. 
Lemma 2: Under the hypotheses of Lemma 1, the total variation of U on 
segments perpendicular to the x-axis is at most T=. [f(Q)]. 
Proof: It suffices to prove the case when the segment ends on the x-axis, 
at the origin. Write U = U, + U_, where ; 


U,(P) = — 0(PQ)df(Q) + const., 

U_= 273 f°. — 0(PQ)df(Q) + const. 
We have, with dP = dy > 0, dp0(PQ) = 6,dy < 0,if 6,dy > 0, if 
Q <0; |dU,(P)| fo” — d,6(PQ)|df(Q)|, |dU-(P)| 
dp(PQ) \df(Q)| 
Thus, integrating over the segment in question, 

S |dUs(P)| < 0(0Q)|df(Q)| = ('/2) So” |df(Q)| 
and similarly 
S |dU-(P)| < \df(Q)|. 

Hence 


S \dU(P)| < + S\dU(P)| < 


Lemma 3: Let S be a minimal surface of the three-dimensional £, n, ¢ 
space, and let S be bounded by a rectifiable Jordan curve C of lengih L. Take 
the conformal mapping of S on the upper half plane y > O of an x, y-plane 
which establishes a continuous one-to-one correspondence a between C and the 
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x-axis' and for which &, n, ¢ become bounded harmonic functions of x, y in 
y > Owith boundary values according to a, for which in y > 0 the two relations 
hold: 


+ + £26; 0 


with = (O/Ox)E, .... 

Denote by &*(x, y), n*(x, y), €*(x, y) conjugate harmonics of &(x, y), n, ¢ 
and call the surface S*, formed with these functions as coordinates of its 
points, the conjugate minimal surface. Denote by L(y) and L*(y) the length 
of the curves on S and S*, respectively, corresponding to fixed yand —~» < 
x < o, Then L(y) = L*(y) > L(O) = L*(0) as y— 0. 

Proof: Wf &*, n*, ¢* are taken so as to vanish at (0, 0), they become 
bounded harmonic functions assuming continuous boundary values of 
bounded total variation on the x-axis. In fact, let 0 < h’ < h, then 


[e*(a,h) — €*(a,h’)| < + + 57)" dy 
< Sil (G+ + 3)" dy 
< Sil (8 + + dy. 


Now by Lemma 2, fo” |&(a, y)|dy < T2..[&(x, 0)] < L(0). 
Hence "1 y)|dy converges and is <3L(0), thereby insuring that 


&* tends to a limit as y > 0. Similar facts hold for n*, ¢*. That these 
boundary values of £*, n*, ¢* are continuous follows easily from 


|e*(a, 0) — E*(b, 0)| < [&*(x, 0)] < lim inf 77 [&*(x, y)] 
< So + + dx 
‘im So (E+ + dx 
< length of C from x = atox = b, 


which also shows that the total variation of the boundary values of £* is 
continuous and remains bounded. Hence Lemma 1’ applies to &*, n*, ¢* 
as well as to £, n, ¢; Lemma 3 follows. 

Lemma 3’: Under the hypotheses of Lemma 3, we have more generally 

Lily) = La(y) > La(0) = 

Proof: Because of semicontinuity the only other possibility could be 

< lim inf L2(y). But then 
L(0) = L*..(0) + L5(0) + < lim inf L2.(y), 


which contradicts Lemma 1’. Analogous considerations exclude L**(0) < 
lim inf L*®(y), as y > 0 through an arbitrary sequence. 
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Lemma 4: Denote by L(O, y) the length of the boundary curve of the surface 
with coordinates cos y + &* sin y, n cos y + sin y, cos y + sin y. 
Then L4(0, y) = L3(0, 0) = L4(0). 

Proof: An elementary computation shows L(y, y) = L(y, 0), and the 
passage y — 0 gives, in view of Lemma 3’, the desired relation. 

LemMA 5: Denote by s the length on C. Excepting a set of measure zero 
with respect to s, we have on C 


O = (dé/ds)(d&*/ds) + (dn/ds)(dn*/ds) + (A) 
Proof: Since 
Li(0, = Li(0, 0) = [(d(é cos y + &* sin y)/ds)? + ... ds 
we find by differentiation with respect to y, setting y = 0, 
0 = «[(dt/ds)(dé*/ds) + ...|ds 


which is the integrated form of Lemma 5, from which it follows by differen- 
tiation with respect to s = s(b). 
THEOREM: L/nder the assumptions of Lemma 3 about S and its coordinates 
§, n, ¢(x, y), suppose furthermore that the boundary curve contains an 
analytic arc A which, without loss of generality, may be taken in the form 
= ¢ = ¥(E) with (0) = ¥(0) = ¢’(0) = = 0, and with and 
y analytic functions of the argument ¢ for sufficiently small values of || ; 


and let the origin of the &, n, ¢-space correspond to the origin of the x, y- 
plane. Then £&(x, 0), n(x, 0), ¢(x, 0) are analytic functions of x at x = 0 
and can be continued analytically near the origin across y = 0. 

With the exception, noted by Riemann, of straight line ares, no other 
‘example of an analytic curve has heretofore been shown to possess the 
property of automatically making the minimal surface analytically exten- 
sible across it.* 

Proof: Set 


(E+ i€*)/2, + in*)/2, = (6 + = x + fy. 


\, w, v(z) are analytic functions for y > 0, continuous in y 2 0. If we 
take the integral f |dd| + |du| + |dv|, extended over the x-axis, a parallel 
to the x-axis, or over the y-axis, it remains by Lemmas 1, 2, 3 bounded 
above by 6L(0). We now define the continuous function o(x, y) = 
S \dd| + |du| + {dv| by extending the integral from (0, 0) to (0, y) to 
(x, y) over straight segments. For sufficiently small h > 0, the square q 
of the corners —h, +h, h + 2th, —h + 2th has the property that within q, 
a(x, y) < efor givene > 0. 

Denote by p, M two positive numbers such that for the complex quantity 
= satisfying |=| < p, we have g(=) and ¥(Z) regular and 


\e’(Z)| < < 
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Take ¢€ so small that 
Me< 1/8and < p. 
We determine a function =(z) as solution of the differential equation 
dz = 2[1 + + {dd + + y'(Z)dv} (5) 


with Z(0) = 0.* Successive approximations give in the usual manner 
Z(z) by integrating over straight segments from 0 to ty to x + ty for all 
x + iy = zofq. To show this, we observe, calling =, the kth approxima- 
tion, that Z, = 0, and 


< | < 20. 
| Furthermore, suppose that |2,| < 40; then, by (5), 


< 2[l — do 
< 2[1 — 2M?.1607}-! do 
< 2[1 — 32(M e)?]-' do 
< 4do, 
and | < 4c. 
Thus =, exists in q for all integral k > 0. For a certain M’, to be de- 
scribed presently, we find 


|d| | < |d(EZx+2 Ent) | < M*| =,|do, 
whence 
|Z: — Zo| < 20, |Z. — < 2M*o*/2, — < 20(M*o)*/(k + 1)! 


Thus Zo + (=: — Zo) + (Z: — + ... has the majorant — 
and converges uniformly in g. Here M” is the maximum of |G(«) — 
; G(k’)|:|« — «’| for the following possibilities 
<p, = 2[L + + = Gilx), 
G(x) = Gilk)e’(k), 
G(x) = Gi(x)p’(k). 


Now set = = lim 2. We have in g the integrated form of (5), 


> @ 


= 2S [1 + {dr + + (6) 


since the Stieltjes integral on the right hand is the limit as k —* © of the 
analogous integral with = replaced by 2. 

That the construction yields an analytic function of z near all z = ty, 
y > 0, follows from the fact that (i) there is for arbitrary initial xo + ¢yo, 
yo > 0, a solution of (5) which is analytic, according to the classical exist- 
ence theorem and (ii) that there is no more than one solution of (5). For 
call 5= the difference of two solutions of (5). We have 
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< |3z\do 


with 6= vanishing initially. This inequality yields 5= = 0 in familiar man- 
ner. 

That the solution & is continuous in the whole of g follows from the same 
property of the =. The latter fact follows from the convergence of the 
total variations of \, wu, v over parallel segments which approach the x-axis 
normally. It is true that a priori (5) will hold on y = 0 only except for 
a set of measure zero relative to the variable s, but the integrated form 
holds everywhere. 

I say that = coincides with &(x, 0) for y = 0. In fact, we have on y = 0 
because n = g(§), ¢ = (&), that almost everywhere in the sense men- 
tioned, in view of (4), 


+ + + + dt/2 + 
i(dt*/2 + o'(E)dn*/2 + = dé. 


Moreover &(0, 0) = 0. Hence é is a possible solution of (5) on y = 0 
and the solution is unique as proved. 

Let 2 = x + ty be a point such that z = x — iyliesing. Put A(z) = 
=(2) — d(2). 
On y = 0, this new A coincides with the former definition. In fact, 


A(x) = E(x) — A(x) = E(x, 0) — (E(x, 0) — 1&*(x, 0))/2 
= (E(x, 0) + 0))/2. 


This new X(z) is regular, being the result of a succession of the two anticon- 
formal mappings z > 2 and 2— (2) — \(2). We can therefore apply the 
well-known theorem that A is regular within qg plus its image by reflection 
on y = 0, in particular on y = 0. 

uw and v can equally be continued across y = 0, thus establishing their 
analyticity at z = 0. We effect the extension by putting for y < 0, 
2=x + ty, 


u(z) = —p(2) + = —v(@) + 


First of all, these new definitions agree with the previous ones for yu, v on 
y = 0, since there 


—u(x) + o(E(x)) = —(n — in*)/2 + o(€) 
= (n + in*)/2 = u(x) 


and similarly 
—v(x) + ¥(E(x)) = v(x). 


Secondly, the same argument as before yields the fact that for y < 0, w and 
v are regular functions of z, thus becoming regular also on the x-axis. 
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The Theorem is proved. 

The idea underlying the analytic extension, of forming a differential 
equation connecting the extension with the given function, is capable of 
application in many other situations. We have treated similarly the 
problem of the ‘free boundary surface’ for minimal surfaces. Other 
problems concerning analytic boundary conditions come under the same 
heading. We intend to return to the subject elsewhere. 


! It suffices to demand assumption of boundary values upon normal approach to the 


boundary. 

2 Evans, G. C., The Logarithmic Potential, Am. Math. Soc. Colloquium Publica- 
tions VI, New York, 1927, pp. 35ff. 

* Courant, R., Dirichlet’s Principle, Conformal Mapping and Minimal Surfaces, In- 
terscience Publishers, 1950, esp. p. 118. 

* More precisely a solution of the integrated equation (6). 


THE CONNECTEDNESS OF THE ORTHOGONAL GROUP 
IN HILBERT SPACE 


By CALVIN R. PUTNAM AND AUREL WINTNER 


Tue INSTITUTE FOR ADVANCED STUDY AND DEPARTMENT OF MATHEMATICS, THE JOHNS 
HopkINns UNIVERSITY 


Communicated by Hermann Weyl, December 26, 1950 


1. A real Euclidean space (of finite dimension number) is orientable 
and, in particular, its orthogonal transformations form two distinct mani- 
folds, those of the rotations and of the reflections. More than twenty 
years ago one of us raised the question whether the real Hilbert space, ®, 
is ‘‘non-orientable" in the following sense: If Q denotes the space of all 
orthogonal transformations O of $k, then 2, when topologized by its natural 
metric (that corresponding to the strong topology of Jt) is a connected set. 
The results of the present note will imply that the answer to this question is 
in the affirmative, at least if connectedness is meant to be arewise connected- 
ness. 

It is understood that the points O of & are meant to be those infinite 
matrices which are real and unitary (hence bounded), that is, those satis- 
fying both O’O = EF and OO’ = EF, where the prime refers to transposition 
and / denotes the unit matrix. The distance, |O,; — O,|, between two 
points of Q will be meant to be the norm, | A}, of the bounded matrix A = 
QO, — Os, defined as the least upper bound of the vector length |Ax| when 
x varies over all vectors x of unit length in MR. 

This defines, in particular, the meaning of the statement that a set of 
points O = O(t),a S$ ¢ S 6, in Qis a continuous path in The theorem 
to be proved is that, corresponding to any pair of points Oi, O, in Q, there 


| 
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exists such a path satisfying O; = O(a), O. = O(b): In other words, every 
point of 2 can be connected, by a continuous are in &, with the ‘‘origin,”’ & 
of Q. 

2. It is well-known that a linear transformation of a real, finite-dimen- 
sional, Euclidean space is a rotation (that is, an orthogonal matrix of de- 
terminant +1) if and only if it is representable in the form e°, where S is a 
real, skéw-symmetric matrix. Furthermore, it is easily verified that if S 
is any infinite matrix which is real, bounded and skew-symmetric, then e 
isa point of 2. [tis therefore natural to define a point, O, of Q to be a rota- 
tion, R, of R if there exists an S = —.S’ satisfying R = e°, where S, as 
every matrix occurring in the sequel, is restricted to be bounded and real. 
The set of those points O of 2 which are rotations, R, will be denoted by 
{). The reflections of R can be defined by negation, that is, as those points 
of 2 which are contained in the complement of Q%. 

The space % of the rotations of 9% is arewise connected. In fact, if an 
R = eis given, then e>, where 0 < ¢ < 1, is a continuous are in Q and 
connects R with e’> = FE. 

In this regard, the rotations of behave as the finite-dimensional rota- 
tions. But while the latter form a group, the rotations of ¥ fail to do so. 
In other words, the product of two matrices contained in {% need not be 
in %. This can be seen as follows: 


If D, is any infinite diagonal matrix in which all but a finite number, n, 
of the diagonal elements are — | and the remaining m of them are +1, then, 


since the two-rowed matrix 


(0-1) 


is a rotation, it is clear that a D, isa rotation of 8. Similar considerations 
show that a — D, is a rotation if m is even. If m is odd, one will expect 
that —D, is a reflection. This happens to be true.’ In particular, both 
D, = —F and any matrix D, are rotations but their product, being a 
— D,, is a reflection. 

3. The fact that the rotations fail to form a group suggests the possi- 
bility that perhaps every orthogonal matrix is the product of a finite num- 
ber of rotations. If this turns out to be true, that is, if every O in 2 can be 
written in the form R,R, ... R,, where every R, is in % (the number, n= 
u(O), of the factors need not be a bounded function of O on %), then the 
arewise connectedness of 2 follows as a corollary. In fact, the path 
O(t) = where 0 < ¢ 1 and R, = e*', will then connect 
= Rik; ... R, = Owith O(0O) = EF. 

It turns out that not only do a finite » = u(O) and the yw rotations Ry = 
R,(O) exist for every O but that u(O) can be chosen independent of O. In 
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fact, u(O) = 3forevery O. In other words, every orthogonal matrix is the 
product of 3 rotations. It will remain undecided whether the 3 can be re- 
duced to 2. 

4. The proof for the sufficiency of 4 = 3 will be based on a result of 
M. H. Martin,? which is based on the spectral resolution® of an arbitrary 
unitary matrix, and states that every O can be factored as follows: O = 
Qe*, where S is skew-symmetric and commutable with Q, and Q is symmet- 
ric and Q? = E (and hence, Q is orthogonal). Actually, QS = SQ will not 
be used in the proof of up = 3. 

Since Q is orthogonal, the spectrum of Q must be on the boundary of the 
unit circle. It follows therefore from the symmetry of Q that the spectrum 
of Q, being on the real axis, cannot contain any point distinct from +1. 
Hence, Q has no continuous spectrum and is therefore, by the Hilbert- 
Hellinger theory of orthogonal invariants, orthogonally equivalent to a 
diagonal matrix. All diagonal elements of the latter are + 1, of course. 

5. If Qisa rotation, then O = Qe° shows that O is a product of 4 = 2 
rotations (hence, of «4 = 3 rotations as well, since the third can be taken to 
be £). Consequently, it is sufficient to show that if Q is a reflection, then 
it is the product of two rotations. 

According to the end of Section 4, the matrix Q is orthogonally equiv- 
alent to a diagonal matrix, D, in which all diagonal elements are = 1. Since 
every orthogonal matrix which is orthogonally equivalent to a rotation is 


again a rotation (i.e., of the form e°), it follows that D is a reflection matrix 
and that Q will be proved to be the product of two rotations if it is shown 
that D is the product of two rotations. It follows, however, from the rules 
of Section 2 that D is a —D, (where, in fact, m is odd) and can therefore 
be represented as the product of the two rotations D, and Dy. 


§ Martin, M. H., Am. J. Math., 54, 592 (1932), 
* See reference 1, p. 590. 
3 Wintner, A., Math. Ztschr., 30, 268-277 (1929). 
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A CLASS OF SOLUBLE DIOPHANTINE EQUA TIONS 


By MorGAN WARD 
DEPARTMENT OF MATHEMATICS, CALIFORNIA INSTITUTE, PASADENA 


Communicated by E. T. Bell, November 21, 1950 


1°. Let R be a commutative ring with a unit element, F(x) a homogene- 
ous polynomial of degree n in ¢ indeterminates x), X2, ..., x With coefficients 
in R. Let J denote the subring of the coefficients of F(x) in R; that is, the 
smallest ring containing all of them. We consider the existence of solutions 
of the diophantine equation 


F(x) = 2” (1) 


in Rorin/. Here zis an indeterminate and m is a given positive integer. 
If y1, Ye, ..., ¥, are ¢ new indeterminates and if there exist ¢ + 1 poly- 
nomials Q(y); P,(y), (¢ = 1, ..., 4), with coefficients in R (or in J) such that 


F(P(y)) = Q(y)* (2) 


identically in the y, (1) will be said to have a ¢-parameter family of solutions 
in R (or in J). 
2°. Turorem. If m is prime to the degree n of F(x), then the diophantine 
equation (1) always has a t-parameter family of solutions I both in R and in I. 
For assume that m is prime to n. If m is less than n, write r for m. 
Then integers k and / exist uniquely determined by m and r such that 


kn +1 =I, 
Define polynomials P(y); Q(y) by 
Py) =» FQ), Oy) = 


Then the coefficients of the P(y) and Q(y) lie in. Since F(x) was assumed 
to be homogeneous of degree n, (2) holds identically in the y with m equal to 
r. 

If m is greater than n, divide m by n and let the quotient be g and the re- 
mainder r. Then if m is prime to n, 


m= qn+r, O<r ian, r prime to n. 
With k, 1, P(y) and Q(y) as before, let 
Then F(y*) = Q(y)’. Hence if 


P,*(y) y*Q(y)* (i 1, t) 
Q*(y) = Q(y), 
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‘then F(P*(y)) = Q*(y)™ identically in the y. Since the polynomials P*(y) 
and Q*(y) have their coefficients in J, the proof is complete. 

3°. The most interesting case of this theorem is when J is the ring of 
ordinary integers. For example the diophantine equation 


x" y" = 


has a two parameter family of integral solutions for every m prime to n; 
the diophantine equation 


x* + y* + 24 = 2” 


has a three-parameter family of integral solutions for every odd m, and so 
on. Many other special cases occur in the literature.! 

4°. The family Y of solutions of (1) in R consists of vectors [é; 7] = 
[é1, £2, ..., of the form 


Pla), n = Qa) m<n, 
t= P*(a), n=Q*(a) m>n. 


Here a@ stands for ¢ arbitrarily chosen elements a, ..., a, of Ror of J. If 
the a are such that F(a) = 0, we obtain the trivial zero solution of (1) and 
this is evidently the only solution of the family 2 with n = 0 if R has zero 
radical. In any event the solutions of (1) in R with z = O are entirely 
independent of the choice of m. 

5°. If Risa field, it is easy to show that every solution [x, A] of (1) in R 
with \ # 0 is of the form 


x, = 6, (i = 1,2, ...,0); On. 


Here [é; n] belongs to the family IN, a and 6 are positive integers depend- 
ing only on m and n, while @ is a field element depending only on \. Thus 
in this case, I gives essentially all solutions of (1) with z ¥ 0. 

6°. The situation is quite different for the solutions I in J if J is a 
domain of integrity. Yt by no means exhausts the possible solutions of (1) 
in J; in fact the components £, 7 of any IM solution will usually have com- 
mon factors in J, For example, if J is the ring of integers, the diophantine 
equation 


has a two-parameter family of integral solutions [&, &, 7] for every odd 
prime m other than three. But the existence of a single integral solution 
with &, £ co-prime [other than the trivial solutions (1, 0; 1), (0, 1; 1)] 
would disprove Fermat’s last theorem. 


' Dickson, History of the Theory of Numbers, Vol. 1. 
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THE DETERMINATION OF CRYSTAL STRUCTURES FROM 
X-RAY DATA WITHOUT A KNOWLEDGE OF THE PHASES OF 
THE FOURIER COEFFICIENTS* 


By Dan McLACHLAN, JR. 
DEPARTMENT OF MINERALOGY, UNIVERSITY OF UTAH 


Communicated by Henry Eyring, December 20, 1950 


Introduction.—Since the development by Patterson! in 1935 of a method 
of making use of | F,,.|? values in the determination of crystal structures, ex- 
perimenters have been constantly confronted with the interpretation of 
‘Patterson structures.’’ In 1936 Harker’ pointed out the convenience of 
investigating certain special sections through three-dimensional Patterson 
structures depending upon the space group. One of the first attempts to 
devise a systematic procedure by which the interpretation of Patterson 
structures might be made easier was published by Wrinch® in 1939, using 
“vector maps.’ In April, 1950, two papers were read before the American 
Crystallographic Association which were somewhat related to the work of 
Wrinch but, from the standpoint of their results, were more related to each 
other. One of these papers was by Buerger‘ in which “vector sets’’ were 
developed. The content® of this paper appeared in March, 1950, and the 
extensions® of the method were published in July, 1950. The second paper 
of this date was by McLachlan’ who developed the “mixed projections” on 
the basis of the theory of the behavior of an optical machine built at the 
University of Utah, for the production of theoretical projections® of several 
kinds. The ‘mixed projections’ are equivalent to the addition of two 
translated Patterson structures; and the Fourier method of using them 
was demonstrated. Very recent work has been reported by Garrido® and 
Clastre."° 

The present paper has two objectives: (a) to present the mixed projec- 
tions in a more concise form, and (6) to introduce the multiplication of 
translated Patterson structures. The multiplication of Patterson struc- 
tures as contrasted to the addition of them has the advantage that no non- 
coincidental maxima appear on the final projection, and therefore the 
crystal structure is deduced without the consideration of ghost atoms. 

The Additive Property of Scattering Functions.—li a configuration of 
electrons (such as an atom in a crystal) has a scattering factor f,(Ak/) and 
is located at the position x,y,z; = 000 in an otherwise empty crystal, then 
the structure factor of the crystal is Pa.) = fi(hkl). If this atom is trans- 
ferred from the origin to a more general position x,y,2%,, the new crystal 
structure factor is obtained from the old one by multiplying it by a factor, 


+ ky + isi) 


/ 
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F(hkl) = wm + + 


If a second atom is located in a second crystal, a similar expression is used, 
and so on for the third, fourth, etc. Now, if all these atoms 1, 2, 3... 


+ + + ky) + + ky) = Fryy. 
FIGURE 1 


The addition of atoms to forma structure. (The fact that single atoms may 
be made equivalent by a shift of origin is being ignored here.) 


were crowded into the same crystal at their respective positions, +1121, 
XoVo%, XyVa2Z ..., the final structure factor of the composite crystal is the 
sum of the structure factors of the separate cells as illustrated in figure 1. 

The two facts described in the last paragraph ((1) that a translation of 
the position of a configuration of electrons in a crystal requires that the 


12 


/ 
/ 


/ 
/ 
/ 


¥ 


fi 20i hx: + fy has 4 = f,F*, or |F|?. 
FIGURE 2 


A process for getting Patterson structures by inflating the atoms in real structures 
and then adding. (Note that although the equation ts in one dimension and the figures are 
in two dimensions, the principle applies to three dimensions.) 


structure factor must be multiplied by an exponential function of the dis- 
tance and direction of translation, and (2) that the structure factor of a 
crystal containing many configurations is the simple sum of the structure 
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factors of the separate configurations) would be trivial if they were con- 
fined to atomic configurations only. However, these two facts are general 
and apply to the translation and congestion of molecules, groups of mole- 
cules or entire cell contents. 

The Concept of the ‘‘Squared’’ Crystal.—If, for example, each atom in the 
final configuration of three atoms |, 2 and 3 as shown in figure | is inflated, 
so to speak, so that their respective structure factors are fifi, fife and fifs, 
then we have a configuration of ‘‘squared’’ atoms, and the crystal struc- 
ture factor is fi Fax. Then if this configuration of squared atoms is trans- 
lated so that atom | is at the origin (see Fig. 2), the new crystal structure of 
translated squared atoms is 


ht ki 
i+ kh + 


\ 


/ 
ae 


| Fr h(x + | Fr h(x2 — x1) + fifs| Pr — xVetc. = | Fr 
FIGURE 3 


Indicating the manner of producing a quadrupled structure from squared structures. 


To get a “‘squared”’ crystal this process must be repeated as many times as 
there are atoms, n, in the original structure and all the translated structures 
collected, without distortion, into one cell. Using this concept, there are 
n terms in the ‘‘squared’’ crystal structure factor, and, as has been shown 
by Patterson,' the structure factor of the squared crystal is | Fy,)!?. 


+ kyt + > > 


extensions Beyond Squared Structures.—lf each of the squared atoms in 
the squared structure is inflated so that its new scattering factor is in- 
creased by a factor f,f, and translated so that the “‘atom’’ 7s is at the origin 
then we have a squared structure of ‘quadrupled’ atoms. Repeating this 
process (as shown in Fig. 3) as many times as there are squared atoms 
produces a “quadrupled” structure or a quadrupled crystal, and if it were 
to exist it would scatter x-rays with an amplitude, Fy)‘. The other crys- 


| 
| | | | 
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tals such as the tripled crystal will not be discussed here because they do not 
give fruitful results. In fact, it will be shown that there is no advantage in 
completing the process of generating a quadrupled crystal because the re- 
sults desired are found before completion. 

A Convenient Symbolism for Treating Structures and Structure Factors.— 
Let the m atoms in a crystal be numbered serially 1, 2,3, ... 7 ... , their 
positions be designated x,y,z, and their atomic structure factors be J, B 
it is assumed that the knowledge of the serial number of an atom implies a 
knowledge of its position and scattering factor, then we can abbreviate our 
treatment considerably. For example, a structure comprising three atoms 
is simply represented by 


TABLE 1 


12(11) | 13(11) | 12(12) | 13(12) 13(13) | 12(13) | 13(13) 


11(11) 
21(11) | 22(11) | 23(11) 21(12) | 22(12) | 23(12) 21(13) | 22(13) | 23(13) 


31(11) | 82(11) | 38011) 31(12) | 32(12) | 33(12) 31(13) | 32(13) | 33(13) 


11(21) | 12(21) | 1321) |} 1122) 12(22) | 1322) || 11(23) | | 13(23) 
21(21) | 22(21) | 23(21) || 21(22) | 22(22) | 23°22) || 21°23) | 22(23) | 23(23) 
31(21) | 32(21) | 33(21) |} 3122) | 32(22) | 33(22) |} 31(23) | 32(23) | 33/23) 


11(31) | 12(31) | 13(31) |} 11032) | 12°32) | 18°32) 11(33) | 12(33) | 13(33) 
21(31) | 22(31) | 23(31) |} 21(32) | 22°32) | 23°32) || 21°33) | 22°33) | 2333) 
31(31) | 32(31) | 33(31) |} 31(82) | 32°32) | 33°32) |] 31°88) | 32(33) | 33(33) 


If an atom j is inflated so that its scattering factor is increased by the factor 
f, the new scattering factor is fjf,; and if also the atom j is translated a 
distance x; — X1, ¥y — Yu 2) — 2%, then the symbol jt may be used to express 
both the inflation and the translation. For example, if the entire structure 
of three atoms 1, 2, and 3 is translated so that atom | is at the origin (see 
Fig. 2) and inflated by f, then the new structure is represented by 


Following this scheme, a Patterson (or squared crystal) of three points is 
represented by 

11 12 13 

21 22 23]. 

31 32 33 


| 
| 
| 
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Similarly the appearance of four numbers j/(rs) means a quadrupled atom 
of scattering factor fif,f,f, at the position (x, — 44) — (x, — x,). For ex- 
ample, if a Patterson structure is inflated by a factor fi f, and translated a 
distance —(x, — X2) the new structure is represented by 


(11 12 13 /11(12) 12(12) 13(12) 
22 23/12 = | 21(12) 22(12) 23(12)). 
31 32. 33) ;31(12) 32(12) 33(12) 
The continuation of this symbolism to represent a quadrupled structure 
is shown in table 1. Table 1 may be simplified somewhat (see table 2) by 
application of the following reductions: 
=0 if 
when +r 
when j 
when 
when 7 
when j 


TABLE 2 


12 21 2 31 32 | 0 
0 21(12) |: 23(12) 21(13) | 31 
32 31(12) | 31 | 21 31(13) | 32(13) 


12(21) o | 2 13 32 | 12(23) | 
12 21 | 23 31 | 32 
32(21) 0 31(23) | 32(23) | 
12(31) 3 | 13(32) 12 | 
13 23 :23(32) 0 
12 | | 28 32 


TABLE 3 


21 | 0 23 


13 
23 21(12) 21 23(12) 
0 


31 32 312) 31 21 


The Coincidences in Structures.—We notice that a structure of three atoms 
produces 9 terms in the table of the squared structure and 81 terms in the 
table of the quadrupled structure. In general, the number of terms in an 
m-fold structure is n”, if there are m atoms in the original structure. The 
squared structure has m coincidences at the origin as represented by the 
zeros along the diagonal of its table of numbers. The quadrupled structure 
of three atoms has 15 coincidences at the origin (see table 2) or in general 
2n? — mcoincidences. Also in the quadrupled structure there are n(n — 1) 


- 
0 
21 : 
31 
12 12 
0 0 
32 | 32 
| 
13 | 13 
23 | 23 
0 | 0 
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non-zero positions of coincidence and each of these occur n*:+ n — | times, 
thus producing a Patterson structure of exaggerated atoms as a part of the 


quadrupled structure. 
It is of more value to consider less than an entire quadrupled structure. 


Let us consider just the coincidences in the first two groups of table 2, as 
shown in table 3. 


FIGURE 4 
The addition of two Patterson structures to produce coincidences. 
The coincidences in table 3 are 0, 21, 31 and 23, and may be shown (see 
Fig. 4) to be the configuration of the original structure and its inversion as 
follows: 


| 2 | 
(21; 1 | 
31, ‘ 31 
23) 23(1) 23} | 3 


From figure 4 it may be seen that 0 and 21 are the common points of the 


structure | 2 3 and its inversion 123. The coincidences are 2n — 2 in 


number. 


TABLE 4 


0 23 || 
21(12) 2113) | 31 | 21 
31(12) | 31 21 3143) 323) | 31 


The Use of Triple Coincidences.—Three groups of table 2, if properly 
chosen, produce a set of triple coincidences whose configuration is that of a 
single original pattern as revealed by table 4. 

The choice of the three maxima from a Patterson summation requires 
some consideration because the third maximum may not be consecutive 


to the first two. 


~ 
| 
| 
4 
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31 | 32 | || 
| 
| 


Von. 37, 1951 PHYSICS: D. McLACHLAN, JR. 121 


Spectal Cases.—When a structure has a center of symmetry, the choice 
of three maxima in the Patterson summation is not necessary. Provided a 
through center vector is chosen to combine with the zero maximum, then 
there is only a single original structure produced. 

Table 5 shows the Patterson projection of four atoms 1,2,1,2 combined 
with a Patterson projection translated the distance 2 2. In this case there 
are just the coincidences 21, 22, 12,0; and adding 2 to these produces the 
original four numbers: 1, 2, 12. 

Special cases that are not so fortunate occur when a vector of length 
equal to one half the distance through the unit cell are present in the Pat- 
terson structure. It can be easily verified by making a table similar to 
table 5 that such a translation causes ambiguities to appear. 

The Multiplication of Squared Structures.—lf two Patterson structures 
are added together after translation, as indicated above, then the single or 


TABLE 5 
2: 2 11(22) 
21 21 22 2: 12 
0 ‘ 2(2: 22 
206 | «622  21(2% 22(2: 12(22) 


non-coincident maxima show up as well as the coincidences, because 
A(uvw) + 0 = A(uvw); but if they are multiplied together the non-coinci- 
dent maxima do not show up because 


A(uvw) X 0 = 0 


The Fourter Treatment of the Addition Process.—The addition process 
implies the sum of a Patterson structure represented by A(uvw) and a 
translated Patterson structure : 


A(u + Ary, 0 + w+ Az,,). 


To conserve space the one-dimensional terminology is used 


(F,)Me~ wthu 
h 


| 42 
0 
21 
22 
where 
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Since all Patterson structures are symmetrical it is more convenient to 
use the reduced equation 


h=@ 
A(u) + A(u + Axy,) = 2 |F,|? [cos 2mhu + cos 2nh(u + Ax;:)] 
= 0 


It is a further convenience to consider that the two cells are translated in 
opposite directions a distance '/2 Ax,;,, in which case the useful equation is 


h=o@ 


A(u + '/2Axy,) + A(u — '/2Ax,,) = 2 (Fr) cos 24 hu 
where 
= 2\ F,|? cos mh 


FIGURE 5A FIGURE 5B 


FIGURE 5A, The Fourier projection of hexamethylene-diamine di-hydrobromide by 
the addition method. 


FIGURE 5B. The theoretical projection of hexamethylene di-hydrobromide. Notice 
how the noncoincident peaks are related to the background in Figure 5a. 


This equation implies that if a Patterson summation is made in the usual 
way and a maximum corresponding to (Ax Ay Az),, is chosen for translation, 
then the experimental | Faxal? values may be multiplied by 


2 cos r(hAxy, + + 


to produce a new set of structure factors (Fx). Then the Fourier summa- 
tion is carried out in the standard manner, thereby solving a structure with- 
out a knowledge of the phases of the coefficients, Fy.:. Figure 5a shows a 


a” @ /O 
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two-dimensional example using the data of Binnie and Robertson" from 
hexamethylenediamine di-hydrobromide. In this projection the bromine 
atoms are shown in the same positions that the original authors found 
them. Figure 5b shows the theoretical added projection. 

The Fourier Treatment of the Multiplication Process—The product of 
two Patterson structures in one dimensions is 


= AL and | Fy’|? with + 
hh’ 


where 
(Fy + w)p = |Fal? and 


This summation, although comprising the same procedure (after solving 
for the (F, + »-)p values) as does the usual Fourier summation, involves 
more terms. Since our machine! goes only to hyo, = Rmax = 9, another, 
but only approximate, procedure has been adopted. The numbers in a 
60 X 60 Patterson projection are multiplied together at a vector distance 
indicated by one of the maxima on the Patterson projection. This latter 
method has very distinct disadvantages because the maxima seldom fall 
exactly on the divisions. However, since the methods of addition and 
multiplication are both expressed by Fourier equations, they should be 
adaptable to computation on such machines as the X-RAC."4 
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PRIOR LEARNING AS A FACTOR IN SHAPING PERFORMANCE 
CURVES* 


By Don Lewis AND ALFRED H. SHEPHARD 
STATE UNIVERSITY OF IOWA 


Communicated by W. R. Miles, November 23, 1950 


Introduction. Performance curves depicting the course of human learn- 
ing or acquisition of skill have been found, under various experimental cir- 
cumstances, to be negatively accelerated, S-shaped, linear, hyperbolic, etc. 
Attempts have been made from time to time to prove that the érue curve 
or function is of some particular form. Most commonly perhaps, the 
“true’’ function is thought to be negatively accelerated throughout and to 
have a kinship to the growth function, but strong arguments for functions 
of other forms, and especially for an S-shaped function, have been presented. 


FIGURE 1 


A careful consideration of results obtained in a large number of repre- 
sentative studies leads to the conclusion that there is no single ‘‘general-- 
ized” learning function, no ‘‘true’’ curve of learning or performance. Pre- 
sumably, the different curves presented by different investigators have not 
been segments or incomplete parts of some ‘‘true’’ function, but instead, 
have each reflected the course of learning as determined by many different 
factors, some of them inseparable from the learning process itself. 

MeGeoch,' in his exhaustive treatment of human learning, lists and dis- 
cusses,six principal determiners of curve-form. They are: 


| COUNTER ANCE 

| | 
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Units of measurement. 

Character of material or activity learned. 
Characteristics of subjects. 

Transfer from prior learning. 
Interference effects and forgetting. 
Physiological limit. 


McGeoch suggests that transfer from prior learning plays a highly signifi- 
cant role in shaping learning curves, but admits that supporting evidence 
has been generally lacking. 

More or less by accident, striking evidence of the role of transfer in shap- 
ing performance curves for a complex motor task was recently obtained in 
an investigation of individual susceptibility to negative transfer effects. 
This evidence will be summarized here. 


FIGURE 


The lasks That Were Learned.—The two motor tasks on which skill was 
acquired were both available on the Turret Pursuit Apparatus,’ a hy- 
draulically driven device providing a target which moved along an irregular 
pathway and a spot of light which subjects learned to keep on the target 
through a manipulation of manual controls. A schematic drawing of the 
apparatus is shown in figure 1. A subject, seated in the tub, grasped the 
two handles shown in figure 2—a close-up view of the control unit. The 
entire control unit could be turned part of a revolution on a vertical axis, 
and the handles could be turned a short distance on a horizontal axis. The 
yoke on the tub carried a light source, as shown in figure 1. A spot of light 
was projected from this source. The spot, which a subject was to keep on 
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the target, moved sidewise whenever the tub rotated and vertically when- 
ever the yoke moved up or down. 

The yoke was caused to move when the nandles on the control unit were 
turned, With the standard arrangement of the controls, the yoke and spot 
of light moved upward when the handles were pushed downward (thumbs 
moving toward the body), and moved downward when the handles were 
pulled upward (thumbs moving away from the body). Rotary (azimuth) 
movements of the tub arose from the turning of the whole control unit. 
With the standard control arrangement, pushing ahead on the left handle 
and pulling backward on the right caused the tub to move in a clockwise 
direction and the spot of light to the right. Pushing on the right handle 
and pulling on the left caused a counter-clockwise movement of the tub 
and a leftward movement of the spot of light. The speed of the tub’s 
azimuth movements and of the yoke’s elevation movements (and the 
accompanying speeds of sidewise and vertical movements of the light) were 
determined by the amount of displacement of the controls from their mid- 
point positions. 

As seen by the subject, the target was a somewhat darkened circle 1'/s in. 
in diameter, located in the center of a white circular plate 12 in. in diameter, 
mounted on the end of along boom. The target spot was a circular opening 
in a small container holding a photocell. The general location of the boom 
and target plate may be seen in figure 1. The forward end of the boom and 
the target were moved through an irregular course by the action of a double 
cam mechanism. As already indicated, it was the subject's task to manipu- 
late the controls so as to keep the spot of light on the moving target. The 
spot at the point of the target was about one inch in diameter. The photo- 
cell circuit was closed whenever the spot of light was more than half-way 
over the target. A record was kept of time on target, which was the total 
time during a practice trial that the photocell circuit was closed. 

Through the use of special valves in the hydraulic system, it was possible 
to reverse the action induced by movements of the controls. This gave the 
so-called reversed arrangement of the controls. With the reversed arrange- 
ment, the usual movements of the controls gave rise to azimuth movements 
of the tub and elevation movements of the yoke which were exactly oppo- 
site in direction to those prevailing under standard conditions. A subject 
could be given practice on the apparatus with the controls either standard 
or reversed, that is, he could acquire skill in performing either the standard 
task or the reversed task. Actually, in the studies of individual suscepti- 
bility to negative transfer effects, the subjects first acquired skill on the 
standard task, then received practice on the reversed task, and were tested 
finally on their retention of skill on the standard task. The marked differ- 
ence between the performances of the same person on the two tasks led to a 
study in which a group of subjects first practiced on the reversed task be- 
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fore learning to perform on the standard task. The shapes of the resulting 
learning curves were strikingly different, as will now be shown. 

Procedures and Results.—Forty male students (Group A) were each given 
70 practice trials on the Turret, 10 trials on each of seven consecutive days. 
Each trial had a duration of 30 seconds and was followed by a rest interval 
of 30 seconds. A somewhat longer rest of two minutes was given each day 
between the 5th and 6th trials. For the first 30 trials (Days 1-3), the 
standard arrangement of the controls was used. In other words, the 
standard task was learned first. On Days 46, for a total of 30 trials, the 
controls were reversed. This practice constituted the interpolated learn- 
ing. Ten trials of relearning occurred on Day 7. 
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FIGURE 3 

Performance data for Group A are summarized in figure 3, where mean 
time on target in seconds is plotted against trials. There are separate 
graphs for the three major phases of learning. The curve representing the 
original learning of the standard task is negatively accelerated and might 
be regarded as similar, in general form, to many curves obtained under 
diverse learning conditions. The curve for the learning of the reversed 
task (middle graph of figure 3) shows a slight tendency to be positively 
accelerated but is generally linear in form. The relearning curve is pre- 
dominantly linear but has a slightly S-shaped appearance. It was the 
marked difference between the curve for the standard task and the curve 
for the reversed task that led to a second study. 
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Group B, composed of 14 male students, received a total of 200 practice 
trials on the Turret, 20 trials on each of ten consecutive days.’ The length 
of each trial and of the inter-trial rest interval was 30 seconds, as it had been 
for Group A. A rest of two minutes was given after each block of five 
trials. For the first 100 practice trials (Days 1-5), the reversed arrange- 
ment of the controls prevailed.‘ The learning of the standard task followed 
on Days 6-5, for 60 trials in all. Relearning of the reversed task occurred 
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FIGURE 4 


on Days 9-10, for 40 trials. 

The data are summarized in figure 4 where time on target in seconds is 
plotted against trials. There are separate graphs for the three phases of 
learning. The general trend of the performance curve for each phase is 
apparent. 

Discussion. The curve for original learning in figure 3 should be com- 
pared with the curve for interpolated learning in figure 4. Skill in per- 
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forming the standard task was being acquired in both cases. It is obvious 
that 100 practice trials on the reversed task had a very pronounced effect 
on the subsequent learning of the standard task. The curves under the 
two circumstances of learning were markedly dissimilar. In contrast to 
the negatively accelerated curve for the learning of the standard task by 
Group A, the curve for the learning of the same task by Group B has an un- 
mistakable S-shape. That the S-shaped appearance of this latter curve 
was not an artifact of the grouping of data for individual subjects is shown 
by figure 5 where the early portions of individual performance curves are 
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FIGURE 5 


presented for six of the 14 subjects. These are representative curves, and 
all display clear signs of positive acceleration in the learning of the standard 
task subsequent to the learning of the reversed task. 

Attention is called to the discontinuity in the interpolated learning curve 
for Group B between Trials 40 and 41 (between Days 7 and 8). The sharp 
drop in proficiency on Trial 41 with relatively slow recovery over several 
trials may have reflected a ‘need for warming up,”’ but it may have been 
related to a persistence of the effects of prior practice on the reversed task. 
Unfortunately, original learning on the standard task was not continued 
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through 60 trials for Group “. If it had been, a choice between the two 
possibilities might have been afforded. As it is, additional data must be 
obtained to provide the basis for a correct interpretation. 

A comparison of the curves for the first 30 trials of the learning of the re- 
versed task by Group B (Fig. 4) and by Group A (Fig. 3) shows them to be 
quite similar. Their similarity might lead to the conclusion that the prior 
learning of the standard task had no important bearing on the learning of 
the reversed task. In this connection, the question must be raised as to 
why the reversed task was initially far more difficult than the standard task. 
The greater difficulty may be safely inferred from the over-all level of per- 
formance throughout the learning sequence. 

The greater difficulty in acquiring skill on the reversed task cannot be 
explained in terms of any inherent aspect of the task itself, such as greater 
complexity or higher level of work output. Judged on the basis of com- 
plexity or expenditure of effort, the standard and reversed tasks should 
have been equally difficult to learn. The difference between them must be 
attributed to prior learning. With the standard arrangement of the con- 
trols, the movements required in turning the tub (that is, in moving the 
light sidewise) were basically similar to those used in steering a car, guiding 
a bicycle, or turning a boat. The movements employed in changing the 
vertical position of the spot of light were similar to those most persons have 
learned in sighting a gun or moving levers on machines. In contrast, the 
responses required when the controls were reversed were directly opposite 
to many responses which most persons have long been making. In fact, 
with the controls reversed, the required movements seemed freakish and 
bizarre.® 

There was every reason to believe that initial performance on the reversed 
task was strongly affected by prior non-laboratory everyday learning. 
This learning was so great in amount that the responses had become almost 
implicit in character. The small amount of practice given to Group A on 
the standard task prior to practice on the reversed task could not have had 
more than a superficial influence on the old and strongly established habits; 
all it might have done was slightly reinforced the habits. It is not surpris- 
ing, therefore, that the performance of the two groups on the reversed task 
was highly similar. Both groups began the reversed task with old and 
firmly established habits of responding in ways required for proficient per- 
formance on the standard task. 

An interesting incidental feature of figure 4 is the performance level 
attained at the end of relearning. On the 5th day of original learning (of 
the reversed task), the subjects improved very little and the curve was 
obviously leveling off at around 4.5 seconds. During relearning (subse- 
quent to 60 trials on the standard task), the reacquisition of skill was quite 
rapid, the average performance level on Trials 36-40 was better than 6.0 
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seconds, and at the termination of practice the general trend of the curve 
was still upward. The higher general level of proficiency on the reversed 
task following practice on the standard task may have been due to in- 
creased motivation; or the subjects may have learned something during 
interpolated practice which transferred to the reversed task. 

Summary.—Two groups of male subjects were given practice on the 
standard and reversed tasks provided by the Turret Pursuit Apparatus. 
The tasks were opposites, that is, they required opposite motor responses 
to the same pattern of stimuli. One group began practicing on the stand- 
ard task, went to the reversed task, and then relearned the standard task. 
The second group began on the reversed task, then practiced the standard 
task, and relearned the reversed task. 

The results showed unmistakably that, under laboratory conditions, 
transfer from prior learning is an important determiner of the shapes of 
performance curves. There was good reason for believing that transfer 
plays a significant role even when the prior learning is of the everyday 
variety occurring outside of the laboratory. 


* This work was done under Contract N9onr-93801 between the State University of 
Iowa and the Special Devices Center, Office of Naval Research. 

! McGeoch, J. A., The Psychology of Human Learning, Longmans, Green, and Co., 
New York, 1942. 

? Lewis, D., and Shephard, A. H., ‘Devices for Studying Associative Interference in 
Psychomotor Performance: IV. The Turret Pursuit Apparatus,” J. Psychol., 29, 173- 
182 (1950). 

3 Twenty trials per day were given instead of ten (as with Group A) in order to reduce 
the total number of consecutive days of practice. From a general knowledge of per- 
formance on the reversed task, it was predicted that the learning of it would be quite 
slow. 

On Days 1 and 2, for a total of 40 trials, half of the 14 subjects of Group B practiced 
the reversed task under special verbal instructions while the other half practiced under 
usual instructions. The special instructions were rather lengthy and provided specific 
information on how the controls should be turned and how they should not be turned for 
desired directions of movement of the spot of light. The purpose was to determine 
whether or not a set to avoid errors and to perform in the correct way could be induced 
by verbal means. The special instructions were read in full before practice began, and a 
condensed version was presented at the outset of Day 2. By the end of Day 2, the 
difference between the performances of the two subgroups was negligible. This led to 
combining them into the single group for which results are presented in figure 4. 

5 One of the subjects in Group B made a revealing comment at the end of his first day 
of practice on the reversed task. He had been told that the experiment was related to 
certain aspects of learning to operate Navy gun turrets. He said, “It’s a wonder we 
ever won the War if we used crazy things like that.” 
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